Gene TM1040_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0422 
Symbol 
ID4076182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp432884 
End bp434839 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content60% 
IMG OID638005717 
ProductBCCT transporter 
Protein accessionYP_612417 
Protein GI99080263 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0280485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.480367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGTTT TTTGTGCAAA GTTCGCGTAC GGGCGCGATG CTGCGGCGCG GCGTGATTGG 
ATCTGCGGCT TTGCGACGTG GCTCTCAGCT TCGCGACATC CCCCCGAATT TCAGTCCCAG
CGGATGCCGC ATCGACGAGT TTTCCTGTCG GATCGCAGTT GGCGCAGGTG GCGCCAAATC
GAGACGATCG CGCAATCCAC CTGCAGGCTT TTGCTGCGGT GGCAACCAAG ATTGCGCCCG
GCACATTCGC CCGAGGTCGC CTTTCCCCAA CAGAGAGGAC CAACCACCAT GCCGCTTAAA
CCGCCTTTGA CGGAGCTTGA GATCAAGACT GCCGACAGCG GATTCTACAA AGGGTTTACC
AAGGATGTGA CAATCACTGC CAAGATCCTT GTTGCCGCCC TGATCCTCTG GGCCGTGGCC
TTTCCAAATC AGGCGGCCAG CGTGCTGTCG CACCTAAACG GGGTCATTCT GGCCAGCTTC
AACTACTGGT ACATCTACGT GGTCGCCTTT TTTGTGGTGG TGAGCCTGGG GCTTGCGATC
TGGCCTGCAT CCGGACGTCT GCGGCTCGGA GCGGAGCATG AAAAGCCAGA GTTCTCGGCC
TTCTCCTGGT TCTCCATGAT GTTCGGCGCT GGCATGGGAA TTGGTGTTCT GACCTATGCC
ACCGGTGAGC CGATCTATCA CTTCCAGAAC AACCCGGATG TCATCCAGGG TTTTGTCGAA
GGGGCCAGCG CCGAGACGGT GCGCTCTGCC TACAAGTGGT CCTTTTTGCA TTGGGGTCTG
TCTCCATGGG GGGCCTATGC GTTGACCGGG CTTGCGCTCG CGTTCTTCGC TTACCGCCGC
GGGCTCCCGC TGACCATTCG GTCGGCGCTG ACGCCTCTGT TCGGCGATCG GCTGTCGGGT
TTTGCGGGTC ATGCCGTCGA TGTGGTGGCC GTCATCGCGA CCGTGCTGGG GGTGGCGCAA
ACGCTCGGCT TTGGGGTCGA GCAATTCGTC GCGGGGCTGA GCCGCATCGG CTTTGGCGAC
TGGCTTCTTG TCACAACCGA TGAAGGAACC CAAAAGGCGT CTGTGGTGGC GATCGTGCTG
TCGCTGGTGA TCATCATGGG GGCTTCGACG CTGTCGGCCC TGTCGGGTGT CGGCAAGGGG
ATCAAATGGC TCTCCAACAT CAACATGGGG TTGAGCGTGT TCTTGCTGTC GTTTTTCCTG
ATCTTTGGCT CCACGGTTTT TGGCCTCACG GCGCTGGTGA CAGGGATCTA TGACTATCTC
CTGGCTTTTC CCGCGATGCT GTTCACCGTC TGGAGCGACA AAGGCACCGA GACCAGCAGC
GCACTTGAGA GCTGGCAGGG TGGTTGGACC ATTTTCTACT GGGCTTGGTG GATTGCGTTT
GCTCCCTTCG TCGGTTTGTT CCTCGCCCGG ATCTCTCGAG GTCGGACGAT CCGCGAATAT
GTCTTTGGCG CGATGCTGGT ACCATCCGTC ATGTGTTTTG TCTGGTTCGC CCTGATTGGC
GGCACCGCCA TCGATCTGGA ACTGTCGGGC ATCGCAGAGG GTGCAATCCT TGGAACGGGG
CTGTCAGACC AGCTCTATGC AACCCTCGCA GTTCTCCTCA GCGATGGTCT GGCCTGGGTG
TTCTCGGTGG TCGTCGCGGT TCTGTTGATG ACCTACCTTG TCACCTCGGC GGATTCGGCG
GTGTTGATCA TCAACACCAT CAACGCCGCA GGTGACGAGG GACCAAAGGC ACGTCCTCAT
ATCATCTTCT GGGGCGTCGC CTTGGCGTTG GTTGTCGGAG CTTTGCTGAT CATTGGTGGT
CTCGGTGCGA TCCAGACCGC GATGATTATC GGCGCGCTGC CCTTCTCGGT GGTGATGGCG
TTGATGGGCG TTGCCTTGAT CAAGGCCATC ATCCTCGACG GGCTTCGGGG CCGCGCGGGA
CTGCCAGTTA CGGCAGATCA GCTCGCGGGC GAGTAG
 
Protein sequence
MPVFCAKFAY GRDAAARRDW ICGFATWLSA SRHPPEFQSQ RMPHRRVFLS DRSWRRWRQI 
ETIAQSTCRL LLRWQPRLRP AHSPEVAFPQ QRGPTTMPLK PPLTELEIKT ADSGFYKGFT
KDVTITAKIL VAALILWAVA FPNQAASVLS HLNGVILASF NYWYIYVVAF FVVVSLGLAI
WPASGRLRLG AEHEKPEFSA FSWFSMMFGA GMGIGVLTYA TGEPIYHFQN NPDVIQGFVE
GASAETVRSA YKWSFLHWGL SPWGAYALTG LALAFFAYRR GLPLTIRSAL TPLFGDRLSG
FAGHAVDVVA VIATVLGVAQ TLGFGVEQFV AGLSRIGFGD WLLVTTDEGT QKASVVAIVL
SLVIIMGAST LSALSGVGKG IKWLSNINMG LSVFLLSFFL IFGSTVFGLT ALVTGIYDYL
LAFPAMLFTV WSDKGTETSS ALESWQGGWT IFYWAWWIAF APFVGLFLAR ISRGRTIREY
VFGAMLVPSV MCFVWFALIG GTAIDLELSG IAEGAILGTG LSDQLYATLA VLLSDGLAWV
FSVVVAVLLM TYLVTSADSA VLIINTINAA GDEGPKARPH IIFWGVALAL VVGALLIIGG
LGAIQTAMII GALPFSVVMA LMGVALIKAI ILDGLRGRAG LPVTADQLAG E