Gene ECH74115_0376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0376 
SymbolbetT 
ID6967316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp381007 
End bp383040 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID643384431 
Productcholine transport protein BetT 
Protein accessionYP_002268946 
Protein GI209395783 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0597337 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC TTTCACACAG CAGGGAAAAG GACAAAATCA ATCCGGTGGT GTTTTACACC 
TCCGCCGGAC TGATTTTGTT GTTTTCCCTG ACAACGATCC TGTTTCGCGA CTTCTCGGCC
CTGTGGATTG GCCGCACGCT GGACTGGGTT TCTAAAACCT TCGGTTGGTA CTATCTGCTG
GCGGCAACGC TCTATATTGT CTTTGTGGTC TGTATCGCTT GTTCGCGTTT TGGTTCGGTG
AAGCTCGGGC CAGAACAATC CAAACCGGAA TTCAGCCTGC TGAGTTGGGC GGCGATGCTG
TTTGCTGCCG GGATCGGTAT CGATCTGATG TTCTTTTCCG TAGCCGAACC GGTAACGCAG
TATATGCAGC CGCCGGAAGG CGCGGGACAG ACGATTGAGG CCGCGCGTCA GGCGATGGTC
TGGACGCTGT TTCACTACGG CTTAACCGGC TGGTCGATGT ATGCGCTGAT GGGCATGGCG
CTCGGATACT TTAGCTATCG TTATAATTTG CCGCTCACCA TCCGCTCGGC GCTGTACCCG
ATCTTCGGTA AACGGATTAA CGGGCCGATA GGTCACTCAG TGGATATTGC AGCGGTGATC
GGCACTATCT TCGGTATTGC CACTACGCTC GGTATCGGTG TGGTGCAGCT TAACTATGGC
TTGAGCGTAC TGTTTGATAT TCCCGATTCG ATGGCGGCAA AAGCGGCACT GATCGCCTTG
TCGGTGATAA TCGCCACTAT CTCGGTGACA TCCGGTGTCG ATAAGGGCAT TCGCGTGTTA
TCGGAGCTTA ATGTCGCGCT GGCGCTGGGA TTGATCCTGT TCGTGTTGTT TATGGGCGAC
ACTTCGTTCC TGCTTAATGC ACTGGTGCTG AATGTTGGCG ACTATGTGAA TCGCTTTATG
GGCATGACGC TCAACAGTTT TGCCTTCGAC CGCCCGGTTG AGTGGATGAA TAACTGGACG
CTCTTCTTCT GGGCATGGTG GGTGGCATGG TCGCCGTTTG TCGGCTTGTT CCTGGCGCGT
ATCTCGCGTG GGCGTACCAT TCGCCAGTTC GTGCTGGGCA CGTTGATTAT TCCGTTTACC
TTCACGCTGT TATGGCTCTC GGTGTTCGGC AATAGCGCGC TGTATGAAAT CATCCACGGC
GGCGCGGCAT TTGCCGAGGA AGCGATGGTC CATCCGGAGC GCGGCTTCTA CAGCCTGCTG
GCGCAGTATC CGGCGTTTAC CTTTAGCGCC TCCGTCGCCA CCATTACTGG CCTGCTGTTT
TATGTGACCT CGGCGGACTC CGGGGCGCTG GTGCTGGGGA ATTTCACCTC GCAGCTTAAA
GATATCAACA GCGACGCCCC CGGCTGGCTG CGCGTCTTCT GGTCGGTGGC GATTGGCCTG
CTGACGCTCG GCATGCTGAT GACCAACGGG ATATCCGCGC TGCAAAACAC CACGGTGATT
ATGGGGCTGC CGTTCAGCTT TGTGATCTTC TTCGTGATGG CGGGGTTGTA TAAATCTCTG
AAGGTAGAAG ATTACCGCCG TGAAAGTGCC AACCGCGATA CCGCACCGCG ACCGCTGGGG
CTTCAGGATC GCCTGAGCTG GAAAAAACGT CTCTCGCGCC TGATGAATTA TCCGGGCACG
CGTTACACTA AACAGATGAT GGAGACGGTC TGTTACCCGG CAATGGAAGA AGTGGCGCAG
GAGTTGCGGT TGCGCGGCGC GTACGTGGAG CTAAAAAGCC TGCCGCCGGA AGAGGGACAA
CAGTTGGGGC ATCTGGATTT GTTGGTGCAT ATGGGCGAAG AGCAAAACTT TGTCTATCAG
ATTTGGCCGC AGCAATACTC GGTGCCGGGC TTTACCTACC GCGCACGTAG CGGGAAATCG
ACCTACTACC GGCTGGAAAC CTTCCTGTTA GAAGGCAGCC AGGGCAACGA CCTGATGGAC
TACAGCAAAG AGCAGGTGAT CACCGATATT CTTGACCAGT ACGAGCGGCA CCTTAACTTT
ATTCATCTCC ATCGTGAAGC GCCGGGCCAT AGCGTGATGT TCCCGGACGC GTGA
 
Protein sequence
MTDLSHSREK DKINPVVFYT SAGLILLFSL TTILFRDFSA LWIGRTLDWV SKTFGWYYLL 
AATLYIVFVV CIACSRFGSV KLGPEQSKPE FSLLSWAAML FAAGIGIDLM FFSVAEPVTQ
YMQPPEGAGQ TIEAARQAMV WTLFHYGLTG WSMYALMGMA LGYFSYRYNL PLTIRSALYP
IFGKRINGPI GHSVDIAAVI GTIFGIATTL GIGVVQLNYG LSVLFDIPDS MAAKAALIAL
SVIIATISVT SGVDKGIRVL SELNVALALG LILFVLFMGD TSFLLNALVL NVGDYVNRFM
GMTLNSFAFD RPVEWMNNWT LFFWAWWVAW SPFVGLFLAR ISRGRTIRQF VLGTLIIPFT
FTLLWLSVFG NSALYEIIHG GAAFAEEAMV HPERGFYSLL AQYPAFTFSA SVATITGLLF
YVTSADSGAL VLGNFTSQLK DINSDAPGWL RVFWSVAIGL LTLGMLMTNG ISALQNTTVI
MGLPFSFVIF FVMAGLYKSL KVEDYRRESA NRDTAPRPLG LQDRLSWKKR LSRLMNYPGT
RYTKQMMETV CYPAMEEVAQ ELRLRGAYVE LKSLPPEEGQ QLGHLDLLVH MGEEQNFVYQ
IWPQQYSVPG FTYRARSGKS TYYRLETFLL EGSQGNDLMD YSKEQVITDI LDQYERHLNF
IHLHREAPGH SVMFPDA