Gene TM1040_2125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2125 
Symbol 
ID4076439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2230638 
End bp2232287 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content60% 
IMG OID638007445 
Productcholine/carnitine/betaine transport 
Protein accessionYP_614119 
Protein GI99081965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC AAACGACGGA AACGCCCGAC GGCATCCCAA GCCCGGACGG CGCAGCACAT 
ATCATTGATA CGGATTATGA AATCGGGCAG GACAACGTCG AGGGGTCCGT TGGCCCCTTT
GGCTTTGACA TCCACAACCC GGTGTTTGCC ATCTCCGGGA TCGCGGTTGT GGCCTTCGTG
TTCTACACGC TCGCACTGCC CGAGCAGGCC GGCAACGTGT TCTCCTGGCT CTTTAGCGCA
GTGACCAAGG GCTTTGACTG GTTCTTCCTC GGTGCCGCAA ATATCTTTGT GATCTTCTGC
CTGTTCCTGA TCGTGACGCC CTTTGGCAAT GTGAGGCTCG GCGGCACGGA GGCAGAACCC
GACTATAGCT ACATTGGCTG GTTTGCGATG CTGTTTGCAG CCGGTATGGG CATCGGTTTG
ATGTTCTATG GGGTTTCCGA ACCTCTGAGC CATTTTGCCT CTTCCATCGG GGGCACCGCA
TCCGAGGGCG GCGTGCGTAC CGACTGGGCA CCACTTGGGG CCGCAGGTGG TAATGAAGCC
GAGGCCGTGC GCCTCGGGAT GGCGGCGACG ATCTTTCACT GGGGGCTTCA CCCTTGGGCG
ATCTATGCCG TTGTGGCGCT TGCGCTGGCC TTGTTCAGCT ACAACAAGGG CCTGCCGCTC
ACGATCCGGT CTGCTTTCTA TCCGATTTTT GGCGATGCCG TCTGGGGTTG GGTTGGCCAT
GTGATCGACA TCCTTGCTGT CTTTGCGACC CTCTTTGGCC TGGCAACTTC GCTCGGATTT
GGCGCAACGC AAGCCAATGC GGGCCTCAAT GAGCTGTTTG GCATCTCGAT CGGGTCCACC
ACTGAGGTCA TCCTGATCTC GGCCATCACC GCAATCGCAC TGGTTTCTGT GCTGCGCGGC
CTTGATGGCG GTGTGAAGAT CCTGTCGGAG ATCAATATGG GTCTGGCCTT TGTGCTGCTG
GTCTTTGTGC TGCTGGTGGG GCCGACAGTG TTCCTGCTGT CGCTGTTCTG GGACTCGCTG
ATGGCCTATT TCCAGTATCT CCCGGCGCTC TCGAACCCGG TCGGACGTGA GGATGTCAAC
TTCATGCAGG GCTGGACGTC GTTCTACTGG GCGTGGTGGA TTTCCTGGTC GCCGTTCGTG
GGCATGTTCA TCGCCCGCGT CAGCCGTGGC CGGACCGTGC GTGAATTCAT CATCTGCGTG
CTGTTGATCC CGTCGATGGT CTGCGTCCTT TGGATGAGCG TATTCGGCGG CACTGCGATC
CATCAGGTTC TCTCTGATGG CTACACCGTA GCGCAGGATG CGGCGCTGGA GCTGAAGCTC
TTCAAGATGC TGGATGTGAT GCCGCTTGCC GGGATCACCT CGCTGGTCGG CATCGTGCTG
GTGATCGTGT TCTTCGTGAC CTCCTCGGAC TCGGGGTCGC TGGTGATCGA CACCATCACC
GCTGGCGGCA AGGTAGATGC GCCGGTACCG CAGCGGGCCT TCTGGTGCGT CTTTGAAGGT
GCGGTTGCGA TCGTTCTCCT CCTGAGCGCC GGCGGCCTGC AATCGTTGCA GTCCATGGTG
ATCTCTACGG GCCTGCCGTT CACGGTTGTG CTGTTGGTGA TGTGCTATGC AATCCTGCGC
GGCCTGATGA GCGAGCGAAA GGCGACCTAA
 
Protein sequence
MTDQTTETPD GIPSPDGAAH IIDTDYEIGQ DNVEGSVGPF GFDIHNPVFA ISGIAVVAFV 
FYTLALPEQA GNVFSWLFSA VTKGFDWFFL GAANIFVIFC LFLIVTPFGN VRLGGTEAEP
DYSYIGWFAM LFAAGMGIGL MFYGVSEPLS HFASSIGGTA SEGGVRTDWA PLGAAGGNEA
EAVRLGMAAT IFHWGLHPWA IYAVVALALA LFSYNKGLPL TIRSAFYPIF GDAVWGWVGH
VIDILAVFAT LFGLATSLGF GATQANAGLN ELFGISIGST TEVILISAIT AIALVSVLRG
LDGGVKILSE INMGLAFVLL VFVLLVGPTV FLLSLFWDSL MAYFQYLPAL SNPVGREDVN
FMQGWTSFYW AWWISWSPFV GMFIARVSRG RTVREFIICV LLIPSMVCVL WMSVFGGTAI
HQVLSDGYTV AQDAALELKL FKMLDVMPLA GITSLVGIVL VIVFFVTSSD SGSLVIDTIT
AGGKVDAPVP QRAFWCVFEG AVAIVLLLSA GGLQSLQSMV ISTGLPFTVV LLVMCYAILR
GLMSERKAT