Gene EcHS_A0373 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0373 
SymbolbetT 
ID5591500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp386803 
End bp388836 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content55% 
IMG OID640919558 
Productcholine transport protein BetT 
Protein accessionYP_001457144 
Protein GI157159826 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value0.834443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACC TTTCACACAG CAGGGAAAAG GACAAAATCA ATCCGGTGGT GTTTTACACC 
TCCGCCGGAC TGATTTTGTT GTTTTCCCTG ACAACGATCC TGTTTCGCGA CTTCTCGGCC
CTGTGGATTG GCCGCACGCT GGACTGGGTT TCTAAAACCT TCGGTTGGTA CTATCTACTG
GCGGCAACGC TCTATATTGT CTTTGTGGTC TGTATCGCTT GTTCGCGTTT TGGTTCGGTG
AAGCTCGGGC CAGAACAATC CAAACCGGAA TTCAGCCTGC TGAGTTGGGC GGCGATGCTG
TTTGCTGCCG GGATCGGTAT CGACCTGATG TTCTTCTCCG TAGCCGAACC GGTAACGCAG
TATATGCAGC CGCCGGAAGG CGCGGGACAG ACGATTGAGG CCGCGCGTCA GGCGATGGTC
TGGACGCTGT TTCACTACGG CTTAACCGGC TGGTCGATGT ATGCGCTGAT GGGCATGGCG
CTCGGATACT TTAGCTATCG TTATAATTTG CCGCTCACCA TCCGCTCGGC GCTGTACCCG
ATCTTCGGTA AACGGATTAA CGGGCCGATA GGGCACTCAG TGGATATTGC AGCGGTGATC
GGCACCATCT TCGGTATTGC CACTACGCTC GGTATCGGTG TGGTGCAGCT TAACTATGGC
TTGAGCGTAC TGTTTGATAT TCCCGATTCG ATGGCGGCAA AAGCGGCACT GATCGCCTTG
TCGGTGATAA TCGCCACGAT CTCTGTCACC TCCGGTGTCG ATAAGGGGAT TCGTGTGTTA
TCGGAACTTA ACGTTGCGCT GGCGCTGGGA TTGATCCTGT TCGTATTGTT TATGGGCGAC
ACCTCGTTCC TGCTTAATGC GCTGGTGCTG AATGTTGGCG ACTATGTGAA TCGCTTTATG
GGCATGACGC TCAACAGTTT TGCCTTCGAC CGTCCAGTTG AGTGGATGAA TAACTGGACG
CTCTTCTTCT GGGCATGGTG GGTGGCATGG TCGCCGTTTG TCGGCTTGTT CCTGGCGCGT
ATCTCGCGTG GGCGTACCAT TCGCCAGTTC GTGCTGGGCA CGTTGATTAT TCCGTTTACC
TTCACGCTGT TATGGCTCTC GGTGTTCGGC AATAGCGCGC TGTATGAAAT CATCCACGGC
GGCGCGGCAT TTGCCGAGGA AGCGATGGTC CATCCGGAGC GCGGCTTCTA CAGCCTGCTG
GCGCAGTATC CGGCGTTTAC CTTTAGCGCC TCCGTCGCCA CCATTACTGG CCTGCTGTTT
TATGTGACCT CGGCGGACTC CGGTGCGCTG GTGCTGGGGA ATTTCACCTC GCAGCTTAAA
GATATCAACA GCGACGCCCC CGGCTGGCTG CGCGTCTTCT GGTCGGTGGC GATTGGCCTG
CTGACGCTCG GCATGCTGAT GACCAACGGG ATATCCGCGC TGCAAAACAC CACGGTGATT
ATGGGGCTGC CGTTCAGCTT TGTGATCTTC TTCGTGATGG CGGGGTTGTA TAAATCTCTG
AAGGTAGAAG ATTACCGCCG TGAAAGTGCC AACCGCGATA CCGCACCGCG ACCGCTGGGG
CTTCAGGATC GCCTGAGCTG GAAAAAACGT CTCTCGCGCC TGATGAATTA TCCGGGCACG
CGTTACACTA AACAGATGAT GGAGACGGTC TGTTACCCGG CAATGGAAGA AGTGGCGCAG
GAGCTGCGGT TGCGCGGCGC ATACGTGGAG CTAAAAAGCC TGCCGCCGGA AGAGGGACAA
CAGTTGGGGC ATCTGGATTT GTTGGTGCAT ATGGGCGAAG AACAAAACTT TGTCTATCAG
ATTTGGCCGC AGCAATATTC GGTGCCGGGC TTTACCTACC GCGCACGTAG CGGTAAATCG
ACCTACTACC GGCTGGAAAC CTTCCTGTTA GAAGGCAGCC AGGGCAACGA CCTGATGGAC
TACAGCAAAG AGCAGGTGAT CACCGATATT CTTGACCAGT ACGAGCGGCA CCTTAACTTT
ATTCATCTCC ATCGTGAAGC GCCGGGCCAT AGCGTGATGT TCCCGGACGC GTGA
 
Protein sequence
MTDLSHSREK DKINPVVFYT SAGLILLFSL TTILFRDFSA LWIGRTLDWV SKTFGWYYLL 
AATLYIVFVV CIACSRFGSV KLGPEQSKPE FSLLSWAAML FAAGIGIDLM FFSVAEPVTQ
YMQPPEGAGQ TIEAARQAMV WTLFHYGLTG WSMYALMGMA LGYFSYRYNL PLTIRSALYP
IFGKRINGPI GHSVDIAAVI GTIFGIATTL GIGVVQLNYG LSVLFDIPDS MAAKAALIAL
SVIIATISVT SGVDKGIRVL SELNVALALG LILFVLFMGD TSFLLNALVL NVGDYVNRFM
GMTLNSFAFD RPVEWMNNWT LFFWAWWVAW SPFVGLFLAR ISRGRTIRQF VLGTLIIPFT
FTLLWLSVFG NSALYEIIHG GAAFAEEAMV HPERGFYSLL AQYPAFTFSA SVATITGLLF
YVTSADSGAL VLGNFTSQLK DINSDAPGWL RVFWSVAIGL LTLGMLMTNG ISALQNTTVI
MGLPFSFVIF FVMAGLYKSL KVEDYRRESA NRDTAPRPLG LQDRLSWKKR LSRLMNYPGT
RYTKQMMETV CYPAMEEVAQ ELRLRGAYVE LKSLPPEEGQ QLGHLDLLVH MGEEQNFVYQ
IWPQQYSVPG FTYRARSGKS TYYRLETFLL EGSQGNDLMD YSKEQVITDI LDQYERHLNF
IHLHREAPGH SVMFPDA