Gene ECH74115_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4156 
SymbolxdhA 
ID6970033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3843034 
End bp3845331 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID643387902 
Productxanthine dehydrogenase subunit XdhA 
Protein accessionYP_002272341 
Protein GI209396943 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.0988466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGC GGGAAGCAAC CGCTACGGGT GAATCATGCA TGCGCGTTGA TGCCATTGCT 
AAGGTCACCG GGCGGGCACG ATATACTGAC GATTATGTTA TGGCGGGCAT GTGTTATGCG
AAATATGTAC GTAGCCCTAT CGCACATGGT TATGCCGTAA GTATTAATGA TGAACAAGCC
AGAAGTTTAC CAGGCGTACT GGCGATTTTT ACCTGGGAAG ATGTGCCTGA TATCCCATTC
GCTACAGCTG GGCATGCATG GACACTTGAC GAAAACAAGC GCGATACCGC CGATCGCGCA
CTGCTAACTC GCCATGTTCG TCATCATGGT GACGCCGTTG CCATCGTCGT GGCCCGCGAT
GAACTCACGG CAGAAAAAGC GGCGCAATTG GTCAGCATTG AGTGGGAAGA ATTACCCGTT
ATCACCACGC CAGAAGCGGC GCTGGCAGAG GACGCTGCAC CAATCCATAA CGGCGGCAAT
TTACTGAAAC AAAGCACGAT GTCGACGGGC AATGTCCAAC AAACAATCGA TGCCGCCGAC
TACCAGGTAC AGGGGCACTA TCAGACCCCC GTTATTCAAC ATTGTCACAT GGAAAGCGTA
ACATCGCTGG CGTGGATGGA GGATGACTCG CGAATTACCA TCGTTTCCAG CACCCAGATC
CCGCACATTG TTCGCCGCGT GGTTGGTCAG GCGCTGGATA TTCCCTGGTC ATGCGTACGA
GTCATCAAAC CATTTGTCGG TGGCGGTTTT GGTAATAAAC AGGATGTACT GGAAGAGCCA
ATGGCGGCAT TCCTGACCAG CAAGCTTGGC GGCATTCCGG TGAAAGTTTC CCTTAGCCGT
GAAGAGTGTT TCCTCGCAAC CCGTACCCGC CACGCTTTTA CTATTGACGG GCAAATGGGC
GTGAACCGCG ACGGAACATT GAAAGGTTAT AGTCTGGATG TTCTGTCTAA CACCGGCGCT
TATGTATCTC ACGGGCACTC CATCGCTTCT GCAGGGGGAA ATAAAGTCGC TTACCTTTAT
CCTCGTTGTG CCTACGCTTA CAGTTCAAAG ACCTGCTATA CCAACCTCCC CTCGGCTGGT
GCGATGCGTG GTTATGGCGC GCCACAAGTC GTATTTGCCG TTGAGTCTAT GCTTGATGAC
GCCGCGACAG CGTTAGGTAT TGATCCTGTT GAAATTCGTT TACGCAACGC CTCACGCGAA
GGAGATGCTA ATCCGCTCAC GGGCAAACGT ATTTACAGCG CAGGGTTGCC GGAGTGTCTT
GAAAAAGGCC GGAAAATCTT TGAATGGGAA AAACGCCGTG CAGAGTGCCA GAACCAGCAA
GGCAATTTAC GCCGCGGCGT TGGCGTCGCC TGTTTTAGCT ACACCTCTAA CACCTGGCCT
GTCGGCGTAG AAATAGCAGG CGCGCGCCTT CTGATGAATC AGGATGGAAC CATCAACGTG
CAAAGCGGCG CGACGGAAAT CGGTCAGGGT GCCGACACCG TCTTCTCGCA AATGGTGGCA
GAAACCGTGG GGGTTCCGGT CAGCGACGTT CGCGTTATTT CAACACAAGA TACCGACGTT
ACGCCGTTCG ATCCCGGCGC ATTTGCCTCA CGCCAGAGCT ATGTTGCCGC GCCTGCGCTG
CGCAGTGCGG CACTGTTATT AAAAGAGAAA ATCATCGCTC ACGCCGCAGT CATGCTACAT
CAGTCAGCGA TGAATCTGAC CCTGATAAAA GGCCATATCG TGCTGGTTGA ACGACCGGAA
GAGCCGTTAA TGTCGTTAAA AGATTTGGCG ATGGACGCTT TCTACCACCC TGAACGCGGC
GGGCAGCTCT CTGCTGAAAG CTCCATCAAA ACCACCACTA ACCCACCGGC GTTCGGCTGT
ACCTTTGTTG ATCTGACGGT CGATATTGCG CTGTGCAAAG TCACCATCAA CCGCATCCTC
AACGTTCATG ATTCGGGCCA TATTCTTAAT CCGCTACTGG CAGAAGGTCA GGTACACGGC
GGAATGGGAA TGGGCATTGG CTGGGCGCTA TTTGAAGAGA TGATCATCGA TGCGAAAAGC
GGCGTGGTCC GTAACCCCAA TCTGCTGGAT TACAAAATGC CGACCATGCC GGATCTGCCA
CAACTGGAAA GCGCGTTCGT CGAAATCAAT GAGCCGCAAT CCGCATACGG ACATAAGTCA
CTGGGTGAGC CACCAATAAT TCCTGTTGCC GCTGCTATTC GTAACGCGGT GAAGATGGCT
ACCGGTGTTG CAATCAATAC ACTGCCGCTG ACGCCAAAAC GGTTATATGA AGAGTTCCAT
CTGGCAGGAT TGATTTGA
 
Protein sequence
MEAREATATG ESCMRVDAIA KVTGRARYTD DYVMAGMCYA KYVRSPIAHG YAVSINDEQA 
RSLPGVLAIF TWEDVPDIPF ATAGHAWTLD ENKRDTADRA LLTRHVRHHG DAVAIVVARD
ELTAEKAAQL VSIEWEELPV ITTPEAALAE DAAPIHNGGN LLKQSTMSTG NVQQTIDAAD
YQVQGHYQTP VIQHCHMESV TSLAWMEDDS RITIVSSTQI PHIVRRVVGQ ALDIPWSCVR
VIKPFVGGGF GNKQDVLEEP MAAFLTSKLG GIPVKVSLSR EECFLATRTR HAFTIDGQMG
VNRDGTLKGY SLDVLSNTGA YVSHGHSIAS AGGNKVAYLY PRCAYAYSSK TCYTNLPSAG
AMRGYGAPQV VFAVESMLDD AATALGIDPV EIRLRNASRE GDANPLTGKR IYSAGLPECL
EKGRKIFEWE KRRAECQNQQ GNLRRGVGVA CFSYTSNTWP VGVEIAGARL LMNQDGTINV
QSGATEIGQG ADTVFSQMVA ETVGVPVSDV RVISTQDTDV TPFDPGAFAS RQSYVAAPAL
RSAALLLKEK IIAHAAVMLH QSAMNLTLIK GHIVLVERPE EPLMSLKDLA MDAFYHPERG
GQLSAESSIK TTTNPPAFGC TFVDLTVDIA LCKVTINRIL NVHDSGHILN PLLAEGQVHG
GMGMGIGWAL FEEMIIDAKS GVVRNPNLLD YKMPTMPDLP QLESAFVEIN EPQSAYGHKS
LGEPPIIPVA AAIRNAVKMA TGVAINTLPL TPKRLYEEFH LAGLI