Gene ECH74115_0331 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0331 
Symbol 
ID6966845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp337219 
End bp338175 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content61% 
IMG OID643384392 
ProductFAD binding domain in molybdopterin dehydrogenase 
Protein accessionYP_002268907 
Protein GI209398590 
COG category[C] Energy production and conversion 
COG ID[COG1319] Aerobic-type carbon monoxide dehydrogenase, middle subunit CoxM/CutM homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGT TTACCTATGA ACGAGTGAAC ACCCCCGCCG AGGCGGCACT TAGCGCTCAG 
CGCGTACCCG GCGCAAAATT TATCGCGGGC GGGACCAATC TGCTGGACCT GATGAAGCTG
GAAATTGAAA CGCCCACCCA CCTTATCGAT GTGAACGGGC TCGGGCTCGA TAAGATCGAA
GTGACCGACG CGGGTGGGCT GCGCATCGGC GCACTGGTAC GGAACACCGA CCTGGTGGCT
CACGAGCGCG TGCGTCGTGA TTACGCGGTA CTCTCCCGCG CCCTGCTCGC TGGCGCGTCT
GGTCAGTTAC GCAATCAGGC AACCACAGCA GGTAATCTGC TCCAGCGCAC GCGCTGCCCC
TATTTTTACG ACACCAATCA GCCCTGCAAT AAGCGCCTGC CCGGGAGCGG CTGCGCGGCG
CTTGAAGGCT TTAGCCGTCA GCACGCGGTG GTAGGCGTAA GCGAAGCCTG CATTGCCACC
CATCCGAGCG ATATGGCGGT CGCAATGCGG TTGCTGGATG CGGTGGTGGA AACCATCACG
CCGGAGGGAA AGACTCGCAG TATCACACTG GCTGATTTTT ATCACCCACC GGGGAAAACG
CCGCACATTG AAACCGCCCT GCTTCCCGGT GAGCTTATCG TTGCGGTGAC GTTACCTCCG
CCGCTCGGCG GAAAACATAT CTACCGTAAG GTGCGCGATC GCGCCTCCTA CGCCTTTGCC
CTGGTATCGG TCGCGGCGAT TATTCAGCCT GACGGCAGCG GGCGCGTCGC GCTGGGCGGA
GTAGCACATA AGCCCTGGCG CATTGAGGCT GCGGATGCTC AGCTATCCCA GGGGGCGCAG
GCCGTATATG ACGCGCTGTT CGCCAGCGCC CATCCCACCG CTGAAAACAC CTTTAAACTC
CTGTTGGCGA AGCGAACGCT TGCCTCCGTA CTGGCTGAAG CGAGGGCACA AGCATGA
 
Protein sequence
MKAFTYERVN TPAEAALSAQ RVPGAKFIAG GTNLLDLMKL EIETPTHLID VNGLGLDKIE 
VTDAGGLRIG ALVRNTDLVA HERVRRDYAV LSRALLAGAS GQLRNQATTA GNLLQRTRCP
YFYDTNQPCN KRLPGSGCAA LEGFSRQHAV VGVSEACIAT HPSDMAVAMR LLDAVVETIT
PEGKTRSITL ADFYHPPGKT PHIETALLPG ELIVAVTLPP PLGGKHIYRK VRDRASYAFA
LVSVAAIIQP DGSGRVALGG VAHKPWRIEA ADAQLSQGAQ AVYDALFASA HPTAENTFKL
LLAKRTLASV LAEARAQA