Gene EcE24377A_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_0331 
SymbolbetT 
ID5587428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp356894 
End bp358927 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID640924056 
Productcholine transport protein BetT 
Protein accessionYP_001461484 
Protein GI157158744 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACC TTTCACACAG CAGGGAAAAG GACAAAATCA ATCCGGTGGT TTTTTACACC 
TCCGCCGGAC TGATTTTGTT GTTTTCCCTG ACAACGATCC TGTTTCGCGA CTTCTCGGCC
CTGTGGATTG GCCGCACGCT GGACTGGGTG TCTAAAACCT TCGGTTGGTA CTATCTGCTG
GCGGCAACGC TCTATATTGT CTTTGTGGTC TGTATCGCTT GTTCGCGTTT TGGTTCGGTG
AAGCTCGGGC CAGAACAATC CAAACCGGAA TTCAGCCTGC TGAGTTGGGC GGCGATGCTG
TTTGCTGCCG GGATCGGTAT CGACCTGATG TTCTTTTCCG TAGCCGAACC GGTAACGCAG
TATATGCAGC CGCCGGAAGG CGCGGGACAG ACGATTGAGG CCGCGCGTCA GGCGATGGTC
TGGACGCTGT TTCACTACGG CTTAACCGGC TGGTCGATGT ATGCGCTGAT GGGCATGGCG
CTCGGATACT TTAGCTATCG TTATAATTTG CCGCTCACCA TCCGCTCGGC GCTGTACCCG
ATCTTCGGTA AACGGATTAA CGGGCCGATA GGGCACTCAG TGGATATTGC AGCGGTGATC
GGCACCATCT TCGGTATTGC CACTACGCTC GGTATCGGTG TGGTGCAGCT TAACTATGGC
TTGAGCGTAC TGTTTGATAT TCCCGATTCG ATGGCGGCAA AAGCGGCACT GATCGCCTTG
TCGGTGATAA TCGCCACGAT CTCTGTCACC TCCGGTGTCG ATAAGGGGAT TCGTGTGTTA
TCGGAACTTA ACGTTGCGCT GGCGCTGGGA TTGATCCTGT TCGTATTGTT TATGGGCGAC
ACCTCGTTCC TGCTTAATGC GCTGGTGCTG AATGTTGGCG ACTATGTGAA TCGCTTTATG
GGCATGACGC TCAACAGTTT TGCCTTCGAC CGTCCAGTTG AGTGGATGAA TAACTGGACG
CTCTTCTTCT GGGCATGGTG GGTGGCATGG TCGCCGTTTG TCGGCTTGTT CCTGGCGCGT
ATCTCGCGTG GGCGTACCAT TCGCCAGTTC GTGCTGGGCA CGTTGATTAT TCCGTTTACC
TTCACGCTGT TATGGCTCTC GGTATTCGGC AATAGCGCGC TGTATGAAAT CATCCACGGC
GGCGCGGCAT TTGCCGAGGA AGCGATGGTC CATCCGGAGC GCGGCTTCTA CAGCCTGCTG
GCGCAGTATC CGGCGTTTAC CTTTAGCGCC TCCGTCGCCA CCATTACTGG CCTGCTGTTT
TATGTGACCT CGGCGGACTC CGGGGCGCTG GTACTGGGGA ATTTCACCTC GCAGCTTAAA
GATATCAACA GCGACGCCCC CGGCTGGCTG CGCGTCTTCT GGTCGGTGGC GATTGGCCTG
CTGACGCTCG GCATGCTGAT GACCAACGGG ATATCCGCGC TGCAAAACAC CACGGTAATC
ATGGGGCTGC CGTTCAGCTT TGTGATTTTC TTCGTGATGG CGGGGTTGTA TAAATCTCTG
AAGGTAGAAG ATTACCGCCG TGAAAGTGCC AACCGCGATA CCGCGCCGCG ACCGCTGGGG
CTTCAGGATC GCCTGAGCTG GAAAAAACGT CTCTCGCGCC TGATGAATTA TCCGGGCACG
CGTTACACTA AACAGATGAT GGAGACGGTC TGTTACCCGG CAATGGAAGA AGTGGCGCAG
GAGCTGCGGT TGCGCGGCGC GTACGTGGAG CTAAAAAGCC TGCCGCCGGA AGAGGGACAA
CAGTTGGGGC ATCTGGATTT GTTGGTGCAT ATGGGCGAAG AACAAAACTT TGTCTATCAG
ATTTGGCCGC AGCAATATTC GGTGCCGGGC TTTACCTACC GCGCACGTAG CGGTAAATCG
ACCTACTACC GGCTGGAAAC CTTCCTGTTA GAAGGCAGCC AGGGCAACGA CCTGATGGAC
TACAGCAAAG AGCAGGTGAT CACCGATATT CTTGACCAGT ACGAGCGGCA CCTTAACTTT
ATTCATCTCC ATCGTGAAGC GCCGGGCCAT AGCGTGATGT TCCCGGACGC GTGA
 
Protein sequence
MTDLSHSREK DKINPVVFYT SAGLILLFSL TTILFRDFSA LWIGRTLDWV SKTFGWYYLL 
AATLYIVFVV CIACSRFGSV KLGPEQSKPE FSLLSWAAML FAAGIGIDLM FFSVAEPVTQ
YMQPPEGAGQ TIEAARQAMV WTLFHYGLTG WSMYALMGMA LGYFSYRYNL PLTIRSALYP
IFGKRINGPI GHSVDIAAVI GTIFGIATTL GIGVVQLNYG LSVLFDIPDS MAAKAALIAL
SVIIATISVT SGVDKGIRVL SELNVALALG LILFVLFMGD TSFLLNALVL NVGDYVNRFM
GMTLNSFAFD RPVEWMNNWT LFFWAWWVAW SPFVGLFLAR ISRGRTIRQF VLGTLIIPFT
FTLLWLSVFG NSALYEIIHG GAAFAEEAMV HPERGFYSLL AQYPAFTFSA SVATITGLLF
YVTSADSGAL VLGNFTSQLK DINSDAPGWL RVFWSVAIGL LTLGMLMTNG ISALQNTTVI
MGLPFSFVIF FVMAGLYKSL KVEDYRRESA NRDTAPRPLG LQDRLSWKKR LSRLMNYPGT
RYTKQMMETV CYPAMEEVAQ ELRLRGAYVE LKSLPPEEGQ QLGHLDLLVH MGEEQNFVYQ
IWPQQYSVPG FTYRARSGKS TYYRLETFLL EGSQGNDLMD YSKEQVITDI LDQYERHLNF
IHLHREAPGH SVMFPDA