Gene ECH74115_4165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4165 
Symbol 
ID6971302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3855162 
End bp3856787 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content52% 
IMG OID643387911 
Productputative xanthine dehydrogenase accessory factor 
Protein accessionYP_002272350 
Protein GI209400720 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1975] Xanthine and CO dehydrogenases maturation factor, XdhC/CoxF family 
TIGRFAM ID[TIGR03309] selenium-dependent molybdenum hydroxylase system protein, YqeB family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTT TCACAGAGGC TGCAAAACTC GAAGAGCAAA ATTGTCCGTT TGCTATGGCG 
CAAATTATTG ATAGCCGAGG CTCGACTCCC CGCCATTCTG CACAAATGTT AGTGCGCGCC
GATGGCTCTA TCGTCGGTAC AATTGGTGGC GGAATGGTTG AGCGGAAGGT GATTGAAGAG
TCGCTTCAGG CATTGCAGGA ACGTAAGCCG CGATTATTCC ATGGACGTAT GGCTCGTAAC
GGTGCGGATG CTGTCGGGTC AGATTGTGGA GGTGCGATGT CAGTGTTTAT CAGCGTCCAT
GGTATGCGTC CACGTCTGGT GTTGATTGGC GCGGGGCATG TCAACCGGGC GATAGCCCAG
AGTGCGGCGC TATTAGGATT TGATATTGCC GTTGCCGATA TTTATCGCGA AAGCCTCAAT
CCTGAACTAT TCCCTCCATC AACCACGCTT CTCCATGCTG AGTCGTTTGG TGCGGCAGTG
GAAGCACTGG ATATTCGCCC TGATAATTTT GTCCTGATTG CCACGAATAA TCAGGATCGT
GAAGCCCTCG ACAAACTCAT TGAACAGCCC ATTGCATGGT TGGGGTTGCT GGCAAGTCGT
CGCAAGGTTC AGCTTTTCCT GCGTCAATTG CGTGAGAAAG GCGTGGCTGA AGAACATATT
GCCCGTTTAC ATGCGCCCGT TGGTTACAAC ATAGGTGCGG AAACGCCGCA GGAGATCGCC
ATCAGCGTGC TGGCAGAAAT ATTACAGGTG AAAAATAACG CGCCGGGTGG GCTGATGATG
AAACCTTCTC ATCCTTCCGG ACACCAGCTG GTGGTGATTC GCGGTGCGGG AGATATCGCC
AGTGGTGTGG CGCTACGTCT GTATCATGCG GGTTTTAAAG TGATCATGCT GGAAGTGGAA
AAACCGACAG TGATTCGTTG TACCGTGGCG TTTGCCCAGG CCGTGTTCGA TGGCGAAATG
ACGGTCGAAG GCGTCACTGC TCGCCTGGCA ACCAGCTCTG CGGAAGCGAT GAAACTTACC
GAACGCGGAT TCATCCCTGT GATGGTAGAT CCCGCCTGTT CATTGCTTGA TGAACTGAAA
CCGCTTTGCG TGGTGGACGC TATTCTGGCG AAACAGAATT TGGGAACACG GGCAGATATG
GCACCAGTAA CAATCGCGCT TGGGCCGGGC TTTGCTGCAG GGAAGGATTG TCATGCGGTA
ATTGAAACAA ATCGCGGGCA CTGGCTCGGT CAGGTGATTT ACTGTGGTTG TGCGCAGGAG
AATACCGGTG TTCCTGGCAA TATTATGGGG CATACCACCC GACGGGTGAT CCGTGCTCCT
GCTGCAGGCA TTATGCGATC CAACGTGAAA TTAGGCGATC TGGTGAAAGA GGGCGATGTG
ATTGCCTGGA TTGGTGAGCA TGAAATTAAA GCACCGTTGA CGGGGATGGT GCGTGGCTTG
TTGAACGACG GCCTGGCAGT GGTTGGTGGT TTTAAAATTG GTGATATCGA TCCTCGTGGT
GAAACCGCTG ATTTCACCAG CGTTTCTGAT AAAGCCCGGG CGATTGGCGG CGGCGTACTT
GAGGCGTTAA TGATGTTGAT GCATCAGGGC GTGAAAGCGA CAAAAGAAGT GCTGGAAGTG
GCTTAA
 
Protein sequence
MNIFTEAAKL EEQNCPFAMA QIIDSRGSTP RHSAQMLVRA DGSIVGTIGG GMVERKVIEE 
SLQALQERKP RLFHGRMARN GADAVGSDCG GAMSVFISVH GMRPRLVLIG AGHVNRAIAQ
SAALLGFDIA VADIYRESLN PELFPPSTTL LHAESFGAAV EALDIRPDNF VLIATNNQDR
EALDKLIEQP IAWLGLLASR RKVQLFLRQL REKGVAEEHI ARLHAPVGYN IGAETPQEIA
ISVLAEILQV KNNAPGGLMM KPSHPSGHQL VVIRGAGDIA SGVALRLYHA GFKVIMLEVE
KPTVIRCTVA FAQAVFDGEM TVEGVTARLA TSSAEAMKLT ERGFIPVMVD PACSLLDELK
PLCVVDAILA KQNLGTRADM APVTIALGPG FAAGKDCHAV IETNRGHWLG QVIYCGCAQE
NTGVPGNIMG HTTRRVIRAP AAGIMRSNVK LGDLVKEGDV IAWIGEHEIK APLTGMVRGL
LNDGLAVVGG FKIGDIDPRG ETADFTSVSD KARAIGGGVL EALMMLMHQG VKATKEVLEV
A