Gene TM1040_2132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2132 
Symbol 
ID4076446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2236933 
End bp2238237 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content62% 
IMG OID638007452 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_614126 
Protein GI99081972 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATGA GATCGAAAAA TGGATCAGCT GACGCCCGCA GCTTCCTGTT TCTGCAGGGG 
CCTCATGGGC CGTTTTTTGC AGCTCTGGGG CGTATGTTGC GCGCAGCGGG CTGCAGCGTG
CACCGGGTTG GGTTCAACAC GGGCGACCGC GTGTTCTGGC CGGACCGGCA AAGCTATATC
GCCTATCGCG ATACACCAGA GGCCTGGCCG GAAAGGTTTG GCGCGCTCCT GGATCAGCGC
GCGATCACCG ATATCGTGCT CTATGGCGAC ACCCGCCCGA TCCATGCGGA TGCAATCGCG
ATCGCAAAGG CACGCGGTAT CACAGTCCAT ACGTTCGAAG AAGGCTATCT GCGCCCCTAC
TGGGTCACCT ATGAGCGCGA CGGCACCAAT GGCAACTCGC GCCTGATGCA GATGCCGGTC
CCTGAGATGG AGCAAGCGCT GGCAAACAGC GACCTCGAAA TGCCAATGCC TCCGGCGCAT
TGGGGCGACA CACGCCAACA TGTGATTTAC GGCGCGCTTT ATCATTGGTT CGTCATGTTT
CTGAACCGGG GCTATCGCAA CTTCCGCCCC CATCGCGCCC TGCCGGTCAC AAAGGAGTTC
CAGCTCTATC TCAACCGGCT GCTGATGATG CCCTTGCATG CCTTGCACCG CCGTCTGGCC
ACGATGCGCA TTCAGCGCGG CGGATTTCCC TATCATCTGG CGCTCTTGCA GCTGGAACAT
GACAGCGCGT TTCAAGCCCA TTCGCCTTTT TCGACCATCA CCGAATTCCT CGAGGTGGTT
GTTGCCGGCT TTGCCGAGGG CGCGCCGCGC CACCACCACC TTGTCTTCAA GGCGCACCCG
CTTGAGGATG GACGCGCCCC GATCCGACGG GCGTTGAAAC GTCTGGCCGC GGAACATGGC
GTCGAGGGGC GCGTGCACTA TGTGCGCGGC GGCAAGCTCG CCGCGCTTTT GAATGAGGCG
CGCACCGCCG TCACCGTGAA CTCCACGGCA GGCCAACAGG TCTTGTGGCG GGGCTTGCCG
CTCAAGGTCT TTGGCCGCGC GGTTTATGAC AAACCGGAAT TCTCCTCGAC CCAAAGCCTG
CCGGAGTTCT TTGCCGCTCC CGCCCGCCCC GATGGGCGCG CCTACAAGCA ATATCGCCGC
TATCTTTTGG AAACCTCTCA GTTTCCTGGC GGGTTTTATT CGCGCTCCGG GCGGCGCCAG
CTGTTGCGGC AAGTGGTGGA TATGATGCTT GCTGCCGACG ATCCCTATGA CGCGCTCCTG
CGCGGCACAG CAGCGCCACG GCAACACTTG CGTGTCGTGA GCTGA
 
Protein sequence
MPMRSKNGSA DARSFLFLQG PHGPFFAALG RMLRAAGCSV HRVGFNTGDR VFWPDRQSYI 
AYRDTPEAWP ERFGALLDQR AITDIVLYGD TRPIHADAIA IAKARGITVH TFEEGYLRPY
WVTYERDGTN GNSRLMQMPV PEMEQALANS DLEMPMPPAH WGDTRQHVIY GALYHWFVMF
LNRGYRNFRP HRALPVTKEF QLYLNRLLMM PLHALHRRLA TMRIQRGGFP YHLALLQLEH
DSAFQAHSPF STITEFLEVV VAGFAEGAPR HHHLVFKAHP LEDGRAPIRR ALKRLAAEHG
VEGRVHYVRG GKLAALLNEA RTAVTVNSTA GQQVLWRGLP LKVFGRAVYD KPEFSSTQSL
PEFFAAPARP DGRAYKQYRR YLLETSQFPG GFYSRSGRRQ LLRQVVDMML AADDPYDALL
RGTAAPRQHL RVVS