Gene ECH74115_0875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0875 
Symbol 
ID6968892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp890737 
End bp892998 
Gene Length2262 bp 
Protein Length753 aa 
Translation table11 
GC content51% 
IMG OID643384900 
Producthypothetical protein 
Protein accessionYP_002269400 
Protein GI209399369 
COG category[C] Energy production and conversion 
COG ID[COG1048] Aconitase A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGT TATCTGAAAA AGGCGTGTTT CTCGCCAGTA ATAACGAAAT AATTGCCGAA 
GAACATTTCA CCGGCGAAAT TAAAAAAGAA GAAGCAAAAA AAGGCACTAT TGCCTGGTCT
ATTCTCTCTT CTCATAATAC GTCCGGAAAT ATGGATAAAC TTAAAATTAA GTTTGATTCA
TTAGCCTCTC ACGATATTAC CTTTGTTGGT ATTGTACAGA CCGCTAAAGC GTCCGGTATG
GAACGTTTCC CGCTGCCGTA TGTGCTGACC AACTGCCATA ACTCACTCTG CGCCGTCGGC
GGCACCATTA ACGGTGATGA CCATGTTTTT GGTTTATCGG CAGCTCAGCG TTATGGCGGT
ATTTTTGTGC CTCCGCATAT TGCGGTCATC CATCAATATA TGCGTGAGAT GATGGCAGGT
GGCGGCAAAA TGATCCTCGG GTCAGACAGT CACACCCGTT ACGGTGCATT AGGGACAATG
GCAGTCGGTG AGGGCGGCGG CGAGTTGGTA AAACAATTGC TTAATGACAC CTGGGATATC
GACTATCCGG GAGTTGTTGC GGTGCATCTG ACCGGAAAAC CAGCGCCGTA TGTGGGGCCG
CAGGATGTGG CGTTGGCTAT CATCGGTGCC GTGTTCAAAA ACGGCTACGT CAAAAACAAA
GTGATGGAAT TCGTAGGTCC CGGTGTTGCT GCGCTCTCTA CCGATTTCCG TAACAGCGTT
GACGTTATGA CCACTGAAAC GACCTGTTTA AGTTCTGTCT GGCAAACCGA TGAAGAAGTC
CATAACTGGC TGGCGCTGCA CGGTCGCGGC CAGGATTACT GCCAGCTTAA CCCTCAACCG
ATGGCGTACT ACGATGGCTG CATCAGCGTT GATTTAAGCG CCATCAAACC AATGATTGCG
CTGCCGTTCC ACCCGAGCAA CGTGTATGAA ATCGACACAC TGAACCAGAA CTTGACCGAC
ATTCTGCGTG AGATTGAAAT TGAGTCCGAA CGCGTGGCGC ACGGTAAAGC CAAACTCTCG
CTGCTGGACA AAGTGGAAAA TGGTCGCCTG AAAGTGCAGC AGGGGATTAT CGCGGGCTGT
TCTGGCGGTA ACTACGAAAA CGTCATCGCG GCGGCGAATG CACTGCGCGG TCAATCCTGT
GGCAATGACA CCTTCTCGCT GGCGGTTTAC CCGTCATCAC AGCCGGTGTT TATGGATCTC
GCCAAAAAAG GTGTGGTAGC AGATTTGATT GGCGCAGGCG CAATCATCAG AACCGCGTTC
TGCGGCCCAT GCTTTGGCGC GGGCGATACG CCAATCAACA ACGGTTTGAG TATTCGCCAC
ACCACGCGTA ACTTCCCGAA CCGCGAAGGC TCTAAGCCAG CTAATGGGCA GATGTCAGCG
GTGGCGTTGA TGGACGCTCG TTCTATCGCT GCGACTGCGG CAAACGGTGG CTATTTAACC
TCTGCCAGCG AACTCGATTG CTGGGACAAC GTGCCGGAGT ACGCCTTCGA TGTAACGCCG
TATAAAAATC GCGTCTATCA AGGCTTTGTG AAAGGGGCGG CCCAGCAACC GCTGATTTAC
GGACCGAACA TTAAAGACTG GCCGGAATTG GGTGCGCTGA CTGACAATAT CGTCCTCAAA
GTGTGCTCGA AGATCCTCGA CGAAGTGACC ACCACCGACG AATTGATTCC TTCCGGTGAA
ACCTCTTCTT ATCGTTCAAA TCCGATTGGT CTGGCGGAGT TTACCCTGTC ACGCCGCGAT
CCAGGTTATG TTGGCAGCAG TAAAGCGACT GCTGAGCTGG AAAATCAGCG TCTGGCGGGG
AATGTCAGCG AGCTGACAGA GGTGTTTGCG CGCATTAAGC AGATTGCTGG TCAGGAGCAT
ATTGATCCGC TGCAAACTGA AATTGGCAGC ATGGTCTATG CGGTGAAACC AGGCGATGGT
TCTGCGCGTG AACAGGCGGC GAGTTGTCAG CGTGTGATTG GCGGTCTGGC GAATATTGCC
GAGGAGTACG CGACTAAACG CTATCGTTCT AACGTCATCA ACTGGGGGAT GTTACCGCTG
CAGATGGTGG AAGCGCCAAC CTTTGAAGTG GGGGATTACA TTTACATCCC TGGCATTAAA
GCGGCGCTGG ATAATCCGGG TACGACGTTT AAAGGTTATG TGATCCATGA AGATGCGCCG
GTAACGGAAA TTACGCTCTA TATGGAAAGT CTGACTGCTG AAGAGCGCGA GATTATCAAG
GCGGGTAGTT TGATTAACTT CAATAAAAAC CGTCAGATGT AA
 
Protein sequence
MIKLSEKGVF LASNNEIIAE EHFTGEIKKE EAKKGTIAWS ILSSHNTSGN MDKLKIKFDS 
LASHDITFVG IVQTAKASGM ERFPLPYVLT NCHNSLCAVG GTINGDDHVF GLSAAQRYGG
IFVPPHIAVI HQYMREMMAG GGKMILGSDS HTRYGALGTM AVGEGGGELV KQLLNDTWDI
DYPGVVAVHL TGKPAPYVGP QDVALAIIGA VFKNGYVKNK VMEFVGPGVA ALSTDFRNSV
DVMTTETTCL SSVWQTDEEV HNWLALHGRG QDYCQLNPQP MAYYDGCISV DLSAIKPMIA
LPFHPSNVYE IDTLNQNLTD ILREIEIESE RVAHGKAKLS LLDKVENGRL KVQQGIIAGC
SGGNYENVIA AANALRGQSC GNDTFSLAVY PSSQPVFMDL AKKGVVADLI GAGAIIRTAF
CGPCFGAGDT PINNGLSIRH TTRNFPNREG SKPANGQMSA VALMDARSIA ATAANGGYLT
SASELDCWDN VPEYAFDVTP YKNRVYQGFV KGAAQQPLIY GPNIKDWPEL GALTDNIVLK
VCSKILDEVT TTDELIPSGE TSSYRSNPIG LAEFTLSRRD PGYVGSSKAT AELENQRLAG
NVSELTEVFA RIKQIAGQEH IDPLQTEIGS MVYAVKPGDG SAREQAASCQ RVIGGLANIA
EEYATKRYRS NVINWGMLPL QMVEAPTFEV GDYIYIPGIK AALDNPGTTF KGYVIHEDAP
VTEITLYMES LTAEEREIIK AGSLINFNKN RQM