Gene EcolC_3309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3309 
Symbol 
ID6067154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3625545 
End bp3627578 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID641602725 
Productcholine transport protein BetT 
Protein accessionYP_001726258 
Protein GI170021304 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC TTTCACACAG CAGGGAAAAG GACAAAATCA ATCCGGTGGT GTTTTACACC 
TCCGCCGGAC TGATTTTGTT GTTTTCCCTG ACAACGATCC TGTTTCGCGA CTTCTCGGCC
CTGTGGATTG GCCGCACGCT GGACTGGGTT TCTAAAACCT TCGGTTGGTA CTATCTGCTG
GCGGCAACGC TCTATATTGT CTTTGTGGTC TGTATCGCTT GTTCGCGTTT TGGTTCGGTG
AAGCTCGGGC CAGAACAATC CAAACCGGAA TTCAGCCTGC TGAGTTGGGC GGCGATGCTG
TTTGCTGCCG GGATCGGTAT CGACCTGATG TTCTTCTCCG TAGCCGAACC GGTAACGCAG
TATATGCAGC CGCCGGAAGG CGCGGGACAG ACGATTGAGG CCGCGCGTCA GGCGATGGTC
TGGACGCTGT TTCACTACGG CTTAACCGGC TGGTCGATGT ATGCGCTGAT GGGCATGGCG
CTCGGATACT TTAGCTATCG TTATAATTTG CCGCTCACCA TCCGCTCGGC GCTGTACCCG
ATCTTCGGTA AACGGATTAA CGGGCCGATA GGGCACTCAG TGGATATTGC AGCGGTGATC
GGCACCATCT TCGGTATTGC CACTACGCTC GGTATCGGTG TGGTGCAGCT TAACTATGGC
TTGAGCGTAC TGTTTGATAT TCCCGATTCG ATGGCGGCAA AAGCGGCACT GATCGCCTTG
TCGGTGATAA TCGCCACGAT CTCTGTCACC TCCGGTGTCG ATAAGGGGAT TCGTGTGTTA
TCGGAACTTA ACGTTGCGCT GGCGCTGGGA TTGATCCTGT TCGTATTGTT TATGGGCGAC
ACCTCGTTCC TGCTTAATGC GCTGGTGCTG AATGTTGGCG ACTATGTGAA TCGCTTTATG
GGCATGACGC TCAACAGTTT TGCCTTCGAC CGTCCAGTTG AGTGGATGAA TAACTGGACG
CTCTTCTTCT GGGCATGGTG GGTGGCATGG TCGCCGTTTG TCGGCTTGTT CCTGGCGCGT
ATCTCGCGTG GGCGTACCAT TCGCCAGTTC GTGCTGGGCA CGTTGATTAT TCCGTTTACC
TTCACGCTGT TATGGCTCTC GGTGTTCGGC AATAGCGCGC TGTATGAAAT CATCCACGGC
GGAGCGGCAT TTGCCGAGGA AGCGATGGTC CATCCGGAGC GCGGCTTCTA CAGCCTGCTG
GCGCAGTATC CGGCGTTTAC CTTTAGCGCC TCCGTCGCCA CCATTACTGG CCTGCTGTTT
TATGTGACCT CGGCGGACTC CGGGGCGCTG GTGCTGGGGA ATTTCACCTC GCAGCTTAAA
GATATCAACA GCGACGCCCC CGGCTGGCTG CGCGTCTTCT GGTCGGTGGC GATTGGCCTG
CTGACGCTCG GCATGCTGAT GACCAACGGG ATATCCGCGC TGCAAAACAC CACGGTGATT
ATGGGGCTGC CGTTCAGCTT TGTGATCTTC TTCGTGATGG CGGGGTTGTA TAAATCTCTG
AAGGTAGAAG ATTACCGCCG TGAAAGTGCC AACCGCGATA CCGCACCGCG ACCGCTGGGG
CTTCAGGATC GCCTGAGCTG GAAAAAACGT CTCTCGCGCC TGATGAATTA TCCGGGCACG
CGTTACACTA AACAGATGAT GGAGACGGTC TGTTACCCGG CAATGGAAGA AGTGGCGCAG
GAGTTGCGGT TGCGCGGCGC GTACGTGGAG CTAAAAAGCC TGCCACCGGA AGAGGGACAG
CAGTTGGGTC ATCTGGATTT GTTGGTGCAT ATGGGCGAAG AGCAAAACTT TGTCTATCAG
ATTTGGCCGC AGCAATATTC GGTGCCGGGC TTTACCTACC GCGCACGCAG CGGTAAATCG
ACCTACTACC GGCTGGAAAC CTTCCTGTTA GAAGGCAGCC AGGGCAACGA CCTGATGGAC
TACAGCAAAG AGCAGGTGAT CACCGATATT CTTGACCAGT ACGAGCGGCA CCTTAACTTT
ATTCATCTCC ATCGTGAAGC GCCGGGCCAT AGCGTGATGT TCCCGGACGC GTGA
 
Protein sequence
MTDLSHSREK DKINPVVFYT SAGLILLFSL TTILFRDFSA LWIGRTLDWV SKTFGWYYLL 
AATLYIVFVV CIACSRFGSV KLGPEQSKPE FSLLSWAAML FAAGIGIDLM FFSVAEPVTQ
YMQPPEGAGQ TIEAARQAMV WTLFHYGLTG WSMYALMGMA LGYFSYRYNL PLTIRSALYP
IFGKRINGPI GHSVDIAAVI GTIFGIATTL GIGVVQLNYG LSVLFDIPDS MAAKAALIAL
SVIIATISVT SGVDKGIRVL SELNVALALG LILFVLFMGD TSFLLNALVL NVGDYVNRFM
GMTLNSFAFD RPVEWMNNWT LFFWAWWVAW SPFVGLFLAR ISRGRTIRQF VLGTLIIPFT
FTLLWLSVFG NSALYEIIHG GAAFAEEAMV HPERGFYSLL AQYPAFTFSA SVATITGLLF
YVTSADSGAL VLGNFTSQLK DINSDAPGWL RVFWSVAIGL LTLGMLMTNG ISALQNTTVI
MGLPFSFVIF FVMAGLYKSL KVEDYRRESA NRDTAPRPLG LQDRLSWKKR LSRLMNYPGT
RYTKQMMETV CYPAMEEVAQ ELRLRGAYVE LKSLPPEEGQ QLGHLDLLVH MGEEQNFVYQ
IWPQQYSVPG FTYRARSGKS TYYRLETFLL EGSQGNDLMD YSKEQVITDI LDQYERHLNF
IHLHREAPGH SVMFPDA