Gene ECH74115_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0044 
SymbolcaiT 
ID6970666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp44824 
End bp46338 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content52% 
IMG OID643384125 
ProductL-carnitine/gamma-butyrobetaine antiporter 
Protein accessionYP_002268648 
Protein GI209399200 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAATG AAAAGAGAAA AACGGGAATA GAACCGAAGG TTTTCTTTCC GCCATTAATA 
ATCGTCGGCA TACTTTGTTG GCTTACAGTC AGAGATCTGG ATGCGGCGAA TGTCGTTATT
AATGCTGTAT TCAGTTACGT CACCAATGTA TGGGGATGGG CATTTGAATG GTATATGGTG
GTGATGCTTT TCGGTTGGTT CTGGCTGGTG TTTGGCCCGT ATGCCAAAAA GCGTTTAGGT
AACGAACCAC CTGAATTTAG TACCGCCAGT TGGATCTTTA TGATGTTCGC CTCCTGTACG
TCTGCTGCCG TACTGTTCTG GGGATCGATT GAGATCTACT ACTACATCTC CACCCCGCCG
TTTGGCTTAG AACCGAACTC GACAGGAGCG AAAGAGTTGG GGCTGGCTTA CAGCTTGTTC
CTCTGGGGAC CTCTGCCGTG GGCCACTTAC AGCTTCCTTT CAGTCGCCTT CGCTTACTTC
TTCTTTGTCC GCAAAATGGA AGTGATTCGC CCCAGCTCCA CCCTGGTGCC GCTGGTAGGT
GAAAAACACG CCAAAGGGTT GTTCGGCACT ATCGTCGACA ACTTCTATCT CGTCGCCTTG
ATCTTCGCGA TGGGTACCAG TCTGGGCCTT GCCACGCCGC TGGTGACCGA GTGTATGCAA
TGGTTGTTTG GCATTCCGCA TACCCTGCAA CTGGACGCTA TCATCATTAC CTGCTGGATT
ATCCTCAACG CCATTTGCGT CGCCTGCGGT CTGCAAAAAG GGGTACGTAT CGCCAGTGAC
GTGCGTAGTT ACCTGAGCTT CCTGATGCTG GGTTGGGTGT TCATCGTCAG CGGTGCCAGC
TTCATCATGA ACTACTTCAC CGATTCGGTG GGGATGTTGC TGATGTATCT GCCGCGCATG
TTGTTCTATA CCGATCCCAT CGCTAAAGGC GGCTTCCCGC AGGGCTGGAC CGTGTTCTAC
TGGGCATGGT GGGTGATTTA CGCCATCCAG ATGAGTATCT TCCTCGCCCG CATCTCCCGT
GGTCGTACTG TGCGTGAACT GTGCTTCGGC ATGGTGATGG GACTGACAGC ATCCACCTGG
ATCCTGTGGA CTGTACTCGG TAGTAACACT CTGCTGTTGA TAGATAAAAA CATCATCAAC
ATTCCAAATC TGATCGAACA GTACGGTGTG GCGCGCGCCA TCATCGAAAC CTGGGCCGCT
CTGCCGCTCA GCACCGCCAC CATGTGGGGC TTCTTCATCC TCTGCTTTAT TGCCACCGTT
ACGCTGGTTA ACGCCTGCTC TTATACCCTG GCGATGTCCA CTTGCCGCGA AGTACGCGAT
GGTGAAGAAC CACCGCTGCT GGTGCGTATC GGTTGGTCAA TTCTGGTTGG CATTATCGGT
ATTGTTTTGC TGGCGCTCGG CGGCCTGAAA CCGATTCAAA CTGCCATTAT CGCCGGAGGA
TGCCCGCTGT TCTTCGTCAA CATTATGGTG ACGCTCTCCT TTATTAAAGA CGCGAAACAG
AACTGGAAAG ATTAA
 
Protein sequence
MKNEKRKTGI EPKVFFPPLI IVGILCWLTV RDLDAANVVI NAVFSYVTNV WGWAFEWYMV 
VMLFGWFWLV FGPYAKKRLG NEPPEFSTAS WIFMMFASCT SAAVLFWGSI EIYYYISTPP
FGLEPNSTGA KELGLAYSLF LWGPLPWATY SFLSVAFAYF FFVRKMEVIR PSSTLVPLVG
EKHAKGLFGT IVDNFYLVAL IFAMGTSLGL ATPLVTECMQ WLFGIPHTLQ LDAIIITCWI
ILNAICVACG LQKGVRIASD VRSYLSFLML GWVFIVSGAS FIMNYFTDSV GMLLMYLPRM
LFYTDPIAKG GFPQGWTVFY WAWWVIYAIQ MSIFLARISR GRTVRELCFG MVMGLTASTW
ILWTVLGSNT LLLIDKNIIN IPNLIEQYGV ARAIIETWAA LPLSTATMWG FFILCFIATV
TLVNACSYTL AMSTCREVRD GEEPPLLVRI GWSILVGIIG IVLLALGGLK PIQTAIIAGG
CPLFFVNIMV TLSFIKDAKQ NWKD