Gene TM1040_3298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3298 
Symbol 
ID4075701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp303158 
End bp304633 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content58% 
IMG OID638004805 
ProductBCCT transporter 
Protein accessionYP_611532 
Protein GI99078274 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.183438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGATA ACAACTCCAA GAGCCACCTG GTGGCGAAGT CCGGCTTCTT TGCCGGTCTC 
CATCCGGGGA TGGGCACTGC CGCAAAAGGA ATGATCGCCT TTTTTGTCAT CTTCACCGTG
TTGAACGTGG ACTTGGCAAG CGGGCTATAC GCGTCCGTGC GCGCCTGGAT CGAAGGCTGG
TTCAGCTGGT ATTACATTTC GCTGCTGATC GGCGTGATGT TCTTCTGCTT CTGGCTGATG
GGGTCGCGAT TTGGCAAGTT GCGGCTTGGA GAGGACGACT CGCGCCCCGA GTTTGGCAAG
TTCAGCTGGT TTGCGATGCT GTTTTCGGCC GGGATCGGTG TCGGCATCCT GTTCTTTGGC
GTGGCAGAGC CGATCTTCTA CTTCGACAAT TCCGGCGCCT TTGGGTATCC CAATAACCCA
CACGCCGATA TGGCGGGCCA TATGGACATG GACATGTTGC GCGCCCGGGA CGCGATGCGG
GTGGCGTTCT TCCACTGGGG TTTTCACGGT TGGGCGCTGT ATGTGCTGGT CGGCTTGTGT
CTCGCCTATT TCGGGTTTCG CAAACGGCTT CCTCTGACCA TGCGCTCCAC GCTCTATCCG
ATCCTAGGCG AGCGGATTTA CGGCCCGTGG GGGCATCTTG TGGACTTGCT GGCGGTGTTT
GGTTCGGTGT TTGGCATTGC AACCTCGCTG GGACTGGGCT CCACGCAGAT CGCGACGGGA
TTGAACGTGC TGTTCGGATT GGAAGTAACC CTGACCCTGA AAATCGTGCT TATCGTGATC
GTATCTGTCA TCGCCACTGC CTCGACCGTA TCAGGCGTTG CCCGCGGCAT CCGCTTTATC
TCGGAATGGA ACATCTGGCT TTCGATCGCT CTGCTCTTGG GGTTCCTGGT TTTGGGACCA
GCAAAATGGC TGATGGCGTT CTTCATCACC TCAGTCGGGG ACTACCTGTG GCACTTCATC
CCGATGGGAT ACTGGACCGC AACCGAAGAA CCAAATGTCG CCTGGCAGGG CGGTTGGACC
ATCTTCTATT GGGGGTGGTG GATCGCCTGG GCACCGCTGG TGGGCATGTT TATCGCGCGG
ATTTCCTATG GCCGCACCAT CCGTGAGTTC ATGGTTGGGG TGCTCTGCGT GCCGACCATT
GCCGTCTTCT TCTGGCTCTG CATCTTCGGC GGCACCGCGA TCTGGCAAGA GATGCACATG
GCCGGTGGAC CAGCGGCTGA AGGCGGCGCA GGCATCATTG CGACCGTCCG CAATTGGGAG
TTGCCTGGAG CGCTTTACGG CACCATTGCC AACATCGGCG AGACGTCCTG GATGGGTGAC
ATGAGCTGGA CCGCCTGGCC GATGTCGCTG CTGGCCAACC TCTCCATGTT TGAGCGTGGC
GCGAAGGATC TTGTGGTCTT TATGGACGAT CTGGAGAGTG TGGTTGGGTC GGGCCGCAAG
CTCACGGCAA ACGAGCTTCG CAAACGCATT CAGTGA
 
Protein sequence
MSDNNSKSHL VAKSGFFAGL HPGMGTAAKG MIAFFVIFTV LNVDLASGLY ASVRAWIEGW 
FSWYYISLLI GVMFFCFWLM GSRFGKLRLG EDDSRPEFGK FSWFAMLFSA GIGVGILFFG
VAEPIFYFDN SGAFGYPNNP HADMAGHMDM DMLRARDAMR VAFFHWGFHG WALYVLVGLC
LAYFGFRKRL PLTMRSTLYP ILGERIYGPW GHLVDLLAVF GSVFGIATSL GLGSTQIATG
LNVLFGLEVT LTLKIVLIVI VSVIATASTV SGVARGIRFI SEWNIWLSIA LLLGFLVLGP
AKWLMAFFIT SVGDYLWHFI PMGYWTATEE PNVAWQGGWT IFYWGWWIAW APLVGMFIAR
ISYGRTIREF MVGVLCVPTI AVFFWLCIFG GTAIWQEMHM AGGPAAEGGA GIIATVRNWE
LPGALYGTIA NIGETSWMGD MSWTAWPMSL LANLSMFERG AKDLVVFMDD LESVVGSGRK
LTANELRKRI Q