Gene EcSMS35_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2999 
SymbolxdhA 
ID6147376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3079286 
End bp3081583 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content53% 
IMG OID641617868 
Productxanthine dehydrogenase subunit XdhA 
Protein accessionYP_001745019 
Protein GI170681574 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.821055 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCGC GGGAAGCAAC CGCTACGGGT GAATCTTGCA TGCGCGTCGA TGCCATTGCT 
AAGGTCACCG GGCGGGCACG ATATACTGAC GATTATGTTA TGGCGGGCAT GTGTTACGCG
AAATATGTAC GTAGCCCTAT CGCACATGGT TATGCCGTAA GTATTAATGA TGAACAAGCC
AGAAGTTTAC CAGGCGTACT GGCGATTTTT ACCTGGGAAG ATGTACCTGA TATCCCATTC
GCTACAGCTG GGCATGCCTG GACACTTGAC GAAAACAAGC GCGATACCGC CGATCGCGCA
CTGCTAACTC GCCATGTTCG TCATCATGGT GACGCCGTTG CCATCGTCGT GGCCCGTGAT
GAACTCACGG CAGAAAAAGC GGCGCAATTG GTCAGCATTG AGTGGCAAGA ATTACCCGTT
ATCACCACGC CAGAAGCGGC GCTGGCAGAT GACGCTGCAC CAATCCATAA CGGCGGCAAT
TTACTAAAAC AAAGCTCGAT GTCGACGGGT AATGTCCAAC AAACAATCGA TGCCGTCGAT
TACCAGGTGC AGGGGCACTA TCAGACGCCG GTCATTCAAC ATTGTCACAT GGAAAGCGTG
ACATCGCTGG CATGGATGGA GGATGACTCG CGAATTACCA TCGTTTCCAG CACCCAGATC
CCGCACATTG TTCGCCGTGT GGTTGGTCAG GCGCTGAATA TTCCCTGGTC ATGCGTACGA
GTCATCAAAC CGTTTGTCGG TGGCGGTTTT GGTAATAAAC AGGATGTACT GGAAGAGCCA
ATGGCGGCAT TCCTGACCAG CAAGCTTGGC GGCATTCCGG TGAAAGTTTC CCTTAGCCGT
GAAGAGTGTT TCCTCGCAAC CCGTACCCGC CACGCTTTTA CTATTGACGG GCAAATGGGC
GTGAACCGCG ACGGAACATT GAAAGGTTAT AGTCTGGATG TTCTGTCTAA CACCGGCGCT
TATGCATCTC ACGGGCACTC CATCGCTTCT GCAGGGGGGA ATAAAGTCGC TTACCTTTAT
CCTCGTTGTG CCTACGCTTA CAGTTCAAAG ACCTGCTATA CCAACCTCCC CTCGGCTGGT
GCGATGCGTG GTTATGGCGC GCCACAAGTC GTATTTGCCG TTGAGTCTAT GCTTGATGAC
GCCGCGACAG CGTTAGGTAT TGATCCTGTT GAAATTCGTT TACGCAACGC CGCCCGCGAA
GGAGATGCTA ATCCGCTCAC AGGCAAACGT ATTTACAGCG CAGGGTTGCC GGAGTGTCTT
GAAAAAGGCC GGAAAATCTT TGAATGGGAA AAACGCCGTG CAGAGTGCCA GAACCAGCAA
GGCAATTTAC GCCGCGGCGT TGGCGTCGCC TGTTTTAGCT ACACCTCTAA CACCTGGCCT
GTCGGCGTAG AAATTGCTGG CGCGCGCCTG TTGATGAATC AGGATGGAAC CATCAACGTA
CAGAGTGGTG CCACGGAAAT CGGCCAGGGT GCCGACACCG TGTTCTCGCA AATGGTGGCA
GAAACCGTGG GGGTTCCGGT CAGCGACATT CGCGTTATTT CAACACAAGA TACCGACGTT
ACGCCGTTCG ATCCCGGCGC ATTTGCCTCA CGCCAGAGCT ATGTTGCCGC GCCCGCGCTA
CGCAGTGCGG CACTGTTATT AAAGGAGAAA ATCATCGCTC ACGCCGCTGT CATGCTACAT
CAGTCAGCGA TGAATCTTAC CTTGATAAAA GGCCATATCG TGCTGGTTGA ACGACCGGAA
GAGCCGTTAA TGTCGTTAAA AGATTTGGCG ATGGATGCCT TCTACCACCC TGAACGCGGC
GGGCAGCTCT CTGCCGAAAG CTCCATCAAA ACCACCACTA ACCCACCAGC GTTTGGCTGT
ACCTTTGTTG ATCTGACGAT CGATATTGCG CTGTGCAAAG TCACCATCAA CCGCATCCTT
AACGTTCATG ATTCGGGCCA TATTCTTAAT CCGTTGCTGG CAGAAGGTCA GGTACACGGC
GGAATGGGAA TGGGCATTGG CTGGGCGCTA TTTGAAGAGA TGATCATCGA TGCGAAAAGC
GGCGTGGTCC GTAACCCCAA TCTGCTGGAT TACAAAATGC CCACCATGCC GGATCTGCCA
CAACTGGAAA GCGCGTTCGT CGAAATCAAT GAGCCGCAAT CCGCATACGG ACATAAGTCA
CTGGGTGAGC CACCAATAAT TCCTGTTGCC GCTGCTATTC GTAACGCGGT GAAGATGGCT
ACCGGTGTTG CAATCAATAC ACTGCCGCTG ACGCCAAAAC GGTTATATGA AGAGTTCCAT
CTGGCAGGAT TGATTTGA
 
Protein sequence
MEAREATATG ESCMRVDAIA KVTGRARYTD DYVMAGMCYA KYVRSPIAHG YAVSINDEQA 
RSLPGVLAIF TWEDVPDIPF ATAGHAWTLD ENKRDTADRA LLTRHVRHHG DAVAIVVARD
ELTAEKAAQL VSIEWQELPV ITTPEAALAD DAAPIHNGGN LLKQSSMSTG NVQQTIDAVD
YQVQGHYQTP VIQHCHMESV TSLAWMEDDS RITIVSSTQI PHIVRRVVGQ ALNIPWSCVR
VIKPFVGGGF GNKQDVLEEP MAAFLTSKLG GIPVKVSLSR EECFLATRTR HAFTIDGQMG
VNRDGTLKGY SLDVLSNTGA YASHGHSIAS AGGNKVAYLY PRCAYAYSSK TCYTNLPSAG
AMRGYGAPQV VFAVESMLDD AATALGIDPV EIRLRNAARE GDANPLTGKR IYSAGLPECL
EKGRKIFEWE KRRAECQNQQ GNLRRGVGVA CFSYTSNTWP VGVEIAGARL LMNQDGTINV
QSGATEIGQG ADTVFSQMVA ETVGVPVSDI RVISTQDTDV TPFDPGAFAS RQSYVAAPAL
RSAALLLKEK IIAHAAVMLH QSAMNLTLIK GHIVLVERPE EPLMSLKDLA MDAFYHPERG
GQLSAESSIK TTTNPPAFGC TFVDLTIDIA LCKVTINRIL NVHDSGHILN PLLAEGQVHG
GMGMGIGWAL FEEMIIDAKS GVVRNPNLLD YKMPTMPDLP QLESAFVEIN EPQSAYGHKS
LGEPPIIPVA AAIRNAVKMA TGVAINTLPL TPKRLYEEFH LAGLI