Gene Swit_4032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_4032 
Symbol 
ID5198829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp4434835 
End bp4436358 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content68% 
IMG OID640583589 
ProductPEP-CTERM locus polysaccharide chain length determinant 
Protein accessionYP_001264514 
Protein GI148556932 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.077437 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.946432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCCA TCTACACCGA GATCAGGATT GCCCTGTACG CGATCTGGCG GATGCGCTGG 
CTGGCGCTCG CCGTCGCCTG GGCCTTCTGC CTGGTCGGCT GGGTGATGGT GATGCGGGTC
CCCGCCGCCT ATGAGAGCAG CGCGCGCATC CAGGTCCAGG TCAAGTCGCT GTTCGCCGAC
GGGGCCGAGA ACGACATGCA GCGCAATGTC GATCGCGTCC GCCAGTCGCT GACCTCGACC
GAGATATTGA AGCGGGTCGT CCGCACCGTC TCCAACGGCG AGCGGCCGCT GACCTCCTAC
GAGTCCCTGT CGCTGATCGG CGCGCTGCGC TCGGGCATCT CCATCACCGC GCAGGGCGAC
GACCTGCTCG AGATCAAGAC GCGGATCAGC CTGAAGGGCA TTTCCGAGCA GCAGACCGCC
GCGATCGCCC GCAACGTCAC CCAGAAGCTG ATCGAGATAT TCATCCAGGA GAATGTGATC
GGCAGCAAGG CGAACAACGC CGAGACGCTG AGCTTCCTCG ACCAGGAACT GGCCCGCCGC
GCCAAGGAGC TGGCCGAGGT CGACCGCCAG CGCGCGCAGA TCACCCAGAA CACGCTCGGC
TCGCTGCCGG GCACGGGCTC GCTCGACCAG CGGATGGACG CCGCGCGCAA CGAGATGGTC
AATCTCGATT CGAACCTGAT GCAGGCGCGC AGCGCGCTGG CGGCGATGAA CGGCCAGCTC
GCCGCCACCC CGGCGCAGAT ACCGGCGGGC AGCGTCAACG GCATCGACAC GCTCGACCAG
CGCATCGGGA CGCTCGAAGG GCAGCTTTCG GAAGCGATCT CGCGCGGCTG GACCGAGAAG
CATCCCGACG TCATCGCGAT CCGTTCCCAG CTCAAGCAGC TCCGCGCCGA AAAGGCGCGC
GGCGGCAGCC GGTCGGTGCC GATGGCGCCC AACCCCGTCT ATGTTTCGCT CAAGTCGATG
CAGGCCGAGC GCGAGGGCAA TGTCGCGGCG CTCCAGGCGC GCAAGTCGCA GCTCGAACAG
GCGGTGACGA CGATCGCGCA GCGCCAGCTC ACCGCGCCCG GTGCCCAGAT CGACCAGGCC
CGGCTCGACC GCGACTATGA CGTGCTCAAG GACCAGTACA ACAAGCTGCT CGCCCAGCGC
GAGTCCGCGA AGCTGCGGTC CGACGCGACC GGCAAGACCG ACGGCGTCCA GTTCCGGGTG
ATCGACCAGC CGTCGCTGCC GCGCCAGCCG GCGGCGCCCA ACCGGGCGAT GCTGCTCACC
GGCGTGCTGG TCGCCGGCAT CGTGATCGGC GCGGGCGTGG CCTTCGCCAA GAGCCAGCTC
GCCAATGTCT ACACGACCAC CCAGCAGCTC GCGAAGGCCA GCGGCCTGCC GGTGCTCGGG
TCGATCTCGG AGGTGATCAC CGCCGACACG CGGGTGATCC GGCGCAAGCA GATGATGTGG
TTCTCCTCGG CCGCCGCGGG GCTGCCGGGG ATGTTCATGC TGCTGATGCT GATCGAGTTC
GTCAAACGCA TCATGGTGTC CTGA
 
Protein sequence
MEAIYTEIRI ALYAIWRMRW LALAVAWAFC LVGWVMVMRV PAAYESSARI QVQVKSLFAD 
GAENDMQRNV DRVRQSLTST EILKRVVRTV SNGERPLTSY ESLSLIGALR SGISITAQGD
DLLEIKTRIS LKGISEQQTA AIARNVTQKL IEIFIQENVI GSKANNAETL SFLDQELARR
AKELAEVDRQ RAQITQNTLG SLPGTGSLDQ RMDAARNEMV NLDSNLMQAR SALAAMNGQL
AATPAQIPAG SVNGIDTLDQ RIGTLEGQLS EAISRGWTEK HPDVIAIRSQ LKQLRAEKAR
GGSRSVPMAP NPVYVSLKSM QAEREGNVAA LQARKSQLEQ AVTTIAQRQL TAPGAQIDQA
RLDRDYDVLK DQYNKLLAQR ESAKLRSDAT GKTDGVQFRV IDQPSLPRQP AAPNRAMLLT
GVLVAGIVIG AGVAFAKSQL ANVYTTTQQL AKASGLPVLG SISEVITADT RVIRRKQMMW
FSSAAAGLPG MFMLLMLIEF VKRIMVS