Gene Moth_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1960 
Symbol 
ID3832311 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2038062 
End bp2040353 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content55% 
IMG OID637829891 
Productxanthine dehydrogenase subunit XdhA 
Protein accessionYP_430801 
Protein GI83590792 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.308037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTA CAGCAGTGGG GAAGAGTGTC ACCAGGGTTG ATGCTGTGGC CAAAGTTACC 
GGGCAAGCAA AGTATACCGG GGATTTTATG TTCAGGGACA TGCTGGTCGG TAAGGTCTTA
AGAAGTCCCT ATGCCCATGC CATCGTTAAA AACATTGATG TCAGCCAGGC CAGGGCTTTA
CCCGGCGTGG AAGGGGTCCT GACTTATAAG GATGTACCGC GGAATAAATT CCCCACTGCC
GGGCATCCCT ATTCCCTGGA TCCGGCCCAC GGCGATAAGG CCGACCGTAC CATCCTGACC
GATAAAGCCC GCTTCGTCGG TGACGCCATC GCGGCGGTAA TAGCCAGGGA TGAGCTGACG
GCCGCCGAGG CCCTGAAGCT TATTAAGGTA GAGTACGAGG TGTTAGAACC CCTCCTCACC
CCGGAGGCAG CCATGCGGGA GGGCGCGCCC CTGATTCATG AGGATTGCCC TGGCAACATC
CTCAGTGCCA GTGGCTATGC CATCGGCGAC GTCGACGAGG CTTTTAAAGA AGCCGACTAT
ATCTTTGAGG ATGAACTGGA AACTAGCATT GTTCAGCATT GCCAGCTGGA AAATCATATA
TCCTGCGCTT ATGTCGACAG CGACGGCCGC ATAGTGATTA TAACCTCCAC CCAGATCCCC
CATATTGTCA GGAGAATCGT CGGCCAGGCC TTGGGTATCC CTTGGGGCAG GATCCGGGTA
ATCAAGCCCT ACGTCGGGGG CGGCTTTGGT AGCAAACAGG ATGTCTGCAC CGAACCCCTG
GCCGCAGCCA TGACCCTGGC CGTGGGGGGA AGGCCGGTAA AACTGGAACT TTCCCGGGAA
GAATGCATGA TTGCCACCCG CACCCGCCAT GCCTTCAAGT TTAAGATCAA AACCGGGGTT
TCCCGGGACG GCAGGTTAAT CGGGATGCAT ATCAAGGCCA TTTCCAATAC CGGGGCCTAT
GCCTCCCACG GTCATTCCAT CGCCATGGCC GGTGGTTCCA AATTCAGGAT TCTTTATCCC
ATGAAGGCTT TGAAATATGA ACCATTTACA GTTTACACCA ACCTGCCGGT GGCGGGAGCC
ATGAGGGGTT ATGGATCGCC CCAGATAACC TTTGCCGTGG AAAGCCATCT GGATAACATC
GCCAGCAAAC TCAATATCGA TCCTATTGAG TTCCGTCTGA AAAACCTGGT TAAAGAGGGT
TATGTCGATC CTTTAAATGG CAACGCGGTG CGGAGTTGCG GCATTCGCGA GTGCATTGCC
CGGGGCAAAG AGTTGATCAA GTGGGACGAG AAAAAAGCGC GGTATAAAAA CCAAACCGGC
AGCAGACGCC GGGGCCTGGG CATGGCCTGC TTCAGCTACG GTTCCGGCAC TTACCCGGTG
GGGCTGGAAA TAGGAGGGGC GCGGATCGTC CTGAACCAGG ACGGTTCCGT CCAGCTACAG
GTTGGAGCTA CAGAAATAGG TCAGGGCAGC GATACCGTCT TTGCCCAGAT GGTAGCCGAG
GTCCTGGGTA TTCCGGTGGA TATGGTCCAT GTCCTTTCCA CCCAGGATAC AGATATTTCC
CCCTTTGATA CCGGTGCCTA TGCTTCCCGG CAGACCTATA TCACCGGTAT GGCGGTGGCC
AGGGCGGCCG CGGAGGTCAA GGAGAAGATC CTGGATTTTG CCTGGGGGAT GACCGATATC
CCAGCCCATG CCCTGGATAT TAAAGATGCC AATATCGTCT ATAAACACTC GGGCGAAGTG
GTCATGCCCC TGGCCGAGGT GGCCCTGCAC ACCTATTACG ACACCGTTTT CGCCAAACCG
ATAACGAGCG ATACCTCCAA TAATGCCCGG GTCAACGCCT TTGTTTTTGG CGTTACCTTT
GCCGAGGTGG AAGTAGACCT GAAAACCGGC CGGATCGAGG TCCTGGAGAT ATATAACGTC
CACGATTCCG GCAGGATTAT CAACCGCCAG CTGGCGGCAG CCCAGGTCCA CGGCGGCGTG
GGCATGGGCA TAGGTTATGC CCTCTCCGAG CAATTGCTCT TTGACGAGAA GACCGGCCAA
CCACTGAATA ACAATCTGCT GGATTACAAA CTGCCTACGA TTATGGATAT TCCCGCAATC
GGCGTAGATT TTGTGGAAAC CTTTGAACCT ACCAGCCCCT TTGGCTGCAA GGCTTTAGGC
GAGCCGCCGA TTATTCCTGT GGCCCCGGCC ATTCGTAATG CCGTTTTCGA CGCCACGGGC
GTGGCTTTTG ACAGGTTGCC CATGAATCCC CAACGGGTTT TCGAAAAGTT CAAGGAAGCC
GGGTTAATAT AG
 
Protein sequence
MASTAVGKSV TRVDAVAKVT GQAKYTGDFM FRDMLVGKVL RSPYAHAIVK NIDVSQARAL 
PGVEGVLTYK DVPRNKFPTA GHPYSLDPAH GDKADRTILT DKARFVGDAI AAVIARDELT
AAEALKLIKV EYEVLEPLLT PEAAMREGAP LIHEDCPGNI LSASGYAIGD VDEAFKEADY
IFEDELETSI VQHCQLENHI SCAYVDSDGR IVIITSTQIP HIVRRIVGQA LGIPWGRIRV
IKPYVGGGFG SKQDVCTEPL AAAMTLAVGG RPVKLELSRE ECMIATRTRH AFKFKIKTGV
SRDGRLIGMH IKAISNTGAY ASHGHSIAMA GGSKFRILYP MKALKYEPFT VYTNLPVAGA
MRGYGSPQIT FAVESHLDNI ASKLNIDPIE FRLKNLVKEG YVDPLNGNAV RSCGIRECIA
RGKELIKWDE KKARYKNQTG SRRRGLGMAC FSYGSGTYPV GLEIGGARIV LNQDGSVQLQ
VGATEIGQGS DTVFAQMVAE VLGIPVDMVH VLSTQDTDIS PFDTGAYASR QTYITGMAVA
RAAAEVKEKI LDFAWGMTDI PAHALDIKDA NIVYKHSGEV VMPLAEVALH TYYDTVFAKP
ITSDTSNNAR VNAFVFGVTF AEVEVDLKTG RIEVLEIYNV HDSGRIINRQ LAAAQVHGGV
GMGIGYALSE QLLFDEKTGQ PLNNNLLDYK LPTIMDIPAI GVDFVETFEP TSPFGCKALG
EPPIIPVAPA IRNAVFDATG VAFDRLPMNP QRVFEKFKEA GLI