Gene EcHS_A3026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3026 
SymbolxdhA 
ID5594661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3027820 
End bp3030117 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID640922143 
Productxanthine dehydrogenase subunit XdhA 
Protein accessionYP_001459645 
Protein GI157162327 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.214024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGCGC GGGAAGCAAC CGCTACGGGT GAATCATGCA TGCGCGTCGA TGCCATTGCT 
AAGGTCACCG GGCGGGCACG ATATACTGAC GATTATGTTA TGGCGGGCAT GTGTTATGCG
AAATATGTAC GTAGCCCTAT CGCACATGGT TATGCCGTAA GTATTAATGA TGAACAAGCC
AGAAGTTTGC CGGGCGTACT GGCGATTTTT ACCTGGGAAG ATGTGCCTGA TATTCCATTC
GCTACAGCTG GGCATGCCTG GACACTTGAC GAAAACAAGC GCGATACCGC CGATCGCGCA
CTGCTAACTC GCCATGTTCG TCATCATGGT GACGCCGTTG CCATCGTCGT GGCCCGCGAT
GAACTCACGG CAGAAAAAGC GGCGCAATTG GTCAGCATTG AGTGGCAAGA ATTACCCGTT
ATCACCACGC CAGAAGCGGC GCTGGCAGAA GACGCTGCAC CAATCCATAA CGGTGGCAAT
TTACTGAAAC AAAGCACGAT GTCGACGGGT AATGTCCAAC AAACAATCGA TGCCGCCGAC
TACCAGGTAC AGGGGCACTA TCAGACCCCC GTTATTCAAC ATTGTCACAT GGAAAGCGTA
ACATCGCTGG CGTGGATGGA GGATGACTCG CGAATTACCA TCGTTTCCAG CACCCAGATC
CCGCACATTG TTCGCCGCGT GGTTGGTCAG GCGCTGGATA TTCCCTGGTC ATGCGTACGA
GTCATCAAAC CATTTGTCGG TGGCGGTTTT GGTAATAAAC AGGATGTACT GGAAGAGCCA
ATGGCGGCAT TCCTGACCAG CAAGCTTGGC GGCATTCCGG TGAAAGTTTC CCTTAGCCGT
GAAGAGTGTT TCCTCGCAAC CCGTACCCGC CACGCTTTTA CCATTGACGG GCAAATGGGC
GTGAACCGCG ACGGAACATT GAAAGGTTAT AGTCTGGATG TTCTGTCTAA CATCGGCGCT
TATGCATCTC ACGGGCACTC CATCGCTTCT GCGGGGGGGA ATAAAGTCGC TTACCTTTAT
CCTCGTTGTG CCTACGCTTA CAGTTCAAAG ACCTGCTATA CCAACCTCCC CTCGGCTGGT
GCGATGCGTG GTTATGGCGC GCCACAAGTC GTATTTGCCG TTGAGTCTAT GCTTGATGAC
GCCGCGACAG CGTTAGGTAT TGATCCTGTT GAAATTCGTT TACGCAACGC CGCCCGCGAA
GGAGATGCTA ATCCGCTCAC GGGCAAACGT ATTTACAGCG CAGGGTTGCC GGAGTGTCTT
GAAAAAGGCC GGAAAATCTT TGAATGGGAA AAACGCCGTG CAGAATGCCA GAACCAGCAA
GGCAATTTGC GCCGCGGCGT TGGCGTCGCC TGTTTTAGCT ACACCTCTAA CACCTGGCCT
GTCGGCGTAG AAATAGCAGG CGCGCGCCTT CTGATGAATC AGGATGGAAC CATCAACGTG
CAAAGCGGCG CGACGGAAAT CGGTCAGGGT GCCGACACCG TCTTCTCGCA AATGGTGGCA
GAAACCGTGG GGGTTCCGGT CAGCGACGTT CGCGTTATTT CAACTCAAGA TACCGACGTT
ACGCCGTTCG ATCCCGGCGC ATTTGCCTCA CGCCAGAGCT ATGTTGCCGC GCCTGCGCTG
CGCAGTGCGG CACTATTATT AAAAGAGAAA ATCATCGCTC ACGCCGCAGT CATGCTACAT
CAGTCAGCGA TGAATCTGAC CCTGATAAAA GGCCATATCG TGCTGGTTGA ACGACCGGAA
GAGCCGTTAA TGTCGTTAAA AGATTTGGCG ATGGACGCTT TCTACCACCC TGAACGCGGC
GGGCAGCTCT CTGCTGAAAG CTCCATCAAA ACCACCACTA ACCCACCGGC GTTTGGCTGT
ACCTTTGTTG ATCTGACGGT CGATATTGCG CTGTGCAAAG TCACCATCAA CCGCATCCTC
AACGTTCATG ATTCAGGGCA TATTCTTAAT CCACTGCTGG CAGAAGGTCA GGTACACGGC
GGAATGGGAA TGGGCATTGG CTGGGCGCTA TTTGAAGAGA TGATCATCGA TGCTAAAAGC
GGCGTGGTCC GTAACCCCAA TCTGCTGGAT TACAAAATGC CGACCATGCC GGATCTGCCA
CAACTGGAAA GCGCGTTCGT CGAAATCAAT GAGCCGCAAT CCGCATACGG ACATAAGTCA
CTGGGTGAGC CACCAATAAT TCCTGTTGCC GCTGCTATTC GTAACGCGGT GAAGATGGCT
ACCGGTGTTG CAATCAATAC ACTGCCGCTG ACGCCAAAAC GGTTATATGA AGAGTTCCAT
CTGGCAGGAT TGATTTGA
 
Protein sequence
MEAREATATG ESCMRVDAIA KVTGRARYTD DYVMAGMCYA KYVRSPIAHG YAVSINDEQA 
RSLPGVLAIF TWEDVPDIPF ATAGHAWTLD ENKRDTADRA LLTRHVRHHG DAVAIVVARD
ELTAEKAAQL VSIEWQELPV ITTPEAALAE DAAPIHNGGN LLKQSTMSTG NVQQTIDAAD
YQVQGHYQTP VIQHCHMESV TSLAWMEDDS RITIVSSTQI PHIVRRVVGQ ALDIPWSCVR
VIKPFVGGGF GNKQDVLEEP MAAFLTSKLG GIPVKVSLSR EECFLATRTR HAFTIDGQMG
VNRDGTLKGY SLDVLSNIGA YASHGHSIAS AGGNKVAYLY PRCAYAYSSK TCYTNLPSAG
AMRGYGAPQV VFAVESMLDD AATALGIDPV EIRLRNAARE GDANPLTGKR IYSAGLPECL
EKGRKIFEWE KRRAECQNQQ GNLRRGVGVA CFSYTSNTWP VGVEIAGARL LMNQDGTINV
QSGATEIGQG ADTVFSQMVA ETVGVPVSDV RVISTQDTDV TPFDPGAFAS RQSYVAAPAL
RSAALLLKEK IIAHAAVMLH QSAMNLTLIK GHIVLVERPE EPLMSLKDLA MDAFYHPERG
GQLSAESSIK TTTNPPAFGC TFVDLTVDIA LCKVTINRIL NVHDSGHILN PLLAEGQVHG
GMGMGIGWAL FEEMIIDAKS GVVRNPNLLD YKMPTMPDLP QLESAFVEIN EPQSAYGHKS
LGEPPIIPVA AAIRNAVKMA TGVAINTLPL TPKRLYEEFH LAGLI