Gene ECH74115_3823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3823 
SymbolkgtP 
ID6968743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3548762 
End bp3550141 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content48% 
IMG OID643387609 
Productalpha-ketoglutarate transporter 
Protein accessionYP_002272062 
Protein GI209398967 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00883] metabolite-proton symporter 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.193326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAACA TTTTCTTTGC CAAAAGAAAT AAACAAAAGC GACCGACAAA AGCATCGGAT 
TACGGCAGGA GACATAATGG CATGGCTGAA AGTACTGTAA CGGCAGACAG CAAACTGACA
AGTAGTGATA CTCGTCGCCG CATTTGGGCG ATTGTGGGGG CCTCTTCAGG TAATCTGGTC
GAGTGGTTCG ATTTCTATGT CTACTCGTTC TGTTCACTCT ACTTTGCCCA CATCTTCTTC
CCTTCCGGGA ACACGACGAC TCAACTACTA CAAACAGCAG GTGTTTTTGC TGCGGGATTC
CTGATGCGCC CAATAGGCGG TTGGCTATTT GGCCGCATAG CCGATAAACA TGGTCGCAAA
AAATCGATGC TGCTATCGGT GTGTATGATG TGTTTCGGCT CACTGGTAAT CGCCTGTTTG
CCTGGTTATG AAACAATCGG TACGTGGGCT CCGGCATTAT TGCTTCTCGC GCGTTTATTT
CAGGGATTAT CCGTTGGCGG AGAATATGGC ACCAGCGCCA CCTATATGAG TGAAGTTGCC
GTTGAAGGGC GCAAAGGTTT TTACGCATCA TTTCAGTATG TGACGTTGAT CGGCGGACAA
CTGCTAGCCC TACTGGTTGT CGTGGTTTTA CAACACACCA TGGAAGACTC TGCACTCAGA
GAGTGGGGAT GGCGTATTCC TTTCGCGTTA GGAGCTGTGT TAGCTGTTGT GGCGTTGTGG
TTACGTCGTC AGTTAGATGA AACTTCGCAA CAAGAAACGC GCGCTTTAAA AGAAGCTGGA
TCTCTGAAAG GATTATGGCG CAATCGCCGT GCATTCATCA TGGTTCTCGG TTTTACCGCT
GCGGGCTCCC TTTGTTTCTA TACCTTCACC ACTTATATGC AGAAGTATCT GGTAAATACT
GCGGGAATGC ATGCCAACGT GGCGAGTGGC ATTATGACTG CCGCATTGTT TGTATTCATG
CTTATTCAAC CACTCATTGG CGCGCTGTCG GATAAGATTG GTCGCCGTAC CTCAATGTTA
TGTTTCGGTT CGCTGGCAGC CATTTTTACC GTTCCTATTC TCTCAGCATT GCAAAACGTT
TCCTCGCCTT ATGCCGCTTT TGGTCTGGTG ATGTGTGCCC TGCTGATAGT GAGTTTTTAT
ACATCAATCA GTGGAATACT GAAGGCTGAG ATGTTCCCGG CACAGGTTCG CGCATTAGGC
GTTGGTCTGT CATATGCGGT CGCTAATGCT ATATTTGGTG GTTCGGCGGA GTACGTAGCG
TTGTCGCTGA AATCAATAGG AATGGAAACA GCCTTCTTCT GGTATGTGAC CTTGATGGCC
GTGGTGGCGT TTCTGGTTTC TTTGATGCTA CATCGCAAAG GGAAGGGGAT GCGTCTTTAG
 
Protein sequence
MYNIFFAKRN KQKRPTKASD YGRRHNGMAE STVTADSKLT SSDTRRRIWA IVGASSGNLV 
EWFDFYVYSF CSLYFAHIFF PSGNTTTQLL QTAGVFAAGF LMRPIGGWLF GRIADKHGRK
KSMLLSVCMM CFGSLVIACL PGYETIGTWA PALLLLARLF QGLSVGGEYG TSATYMSEVA
VEGRKGFYAS FQYVTLIGGQ LLALLVVVVL QHTMEDSALR EWGWRIPFAL GAVLAVVALW
LRRQLDETSQ QETRALKEAG SLKGLWRNRR AFIMVLGFTA AGSLCFYTFT TYMQKYLVNT
AGMHANVASG IMTAALFVFM LIQPLIGALS DKIGRRTSML CFGSLAAIFT VPILSALQNV
SSPYAAFGLV MCALLIVSFY TSISGILKAE MFPAQVRALG VGLSYAVANA IFGGSAEYVA
LSLKSIGMET AFFWYVTLMA VVAFLVSLML HRKGKGMRL