Gene Moth_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2135 
Symbol 
ID3833135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2232314 
End bp2233285 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content62% 
IMG OID637830060 
ProductGTP cyclohydrolase subunit MoaA 
Protein accessionYP_430970 
Protein GI83590961 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAGGACA CCTTTCAACG CCAGATAAAC TACCTGCGCA TCGCCATTAC CGATCGCTGT 
AACCTGCGCT GCCGTTATTG TATGCCGGCC ACGGGGGTGC CCTTGAAGGG TCACGAGGAT
ATCCTGCGCC TGGAAGAGAT CGCCACCCTG GCCCGGGTAG CTGCCGGTAC TGGTATCAGC
CGGATTCGCC TCACCGGGGG CGAGCCCCTG GTCCGGAAAA ACGTGGTGAC CCTGGTGCGG
GAACTGGCGG CCATTCCCGG CCTGGAGGAG ATCTCCCTGA CAACCAACGG CATCTTCCTG
GGGGCCCTGG CCTTTTCTTT AAAAGAGGCC GGACTGAAGC GGGTGAATAT CAGCCTGGAC
ACCCTGAAGA AGGACCGCTA CCGCTATATC ACCCGCCGCG GCAACATCAC CAGCGTCTGG
CAGGGCATCC GGGCGGCCCT GGCCGCTGGC CTGACGCCGG TTAAACTCAA TGTCGTCATT
ACGCGGGGCT TTAACGACGA TGAGATCCTG GATTTTGCCC GGCTGGCCAG GGAAGAACCC
CTGCATATCC GTTTTATCGA GCTCATGCCC ATTGGTACGG CGGCCGCCTC CGGTACCGCT
TATGTGCCGG CGGAGGAGAT TAAGGGCCGG ATCAGCCGGG TTTACCCCCT GGAACCCTTC
CCGGACCTGG CAACCAACGG GCCGGCAGCC AATTTCAGGC TGGTCGGCGG CCGGGGAAGT
GTGGGATTTA TCACCCCCAT GTCCAATCAC TTCTGTTCCC GCTGTAACCG CCTGCGCCTG
ACGGCAGACG GCAAGCTCAG GCCCTGCCTC TACTGGGACG GGGAGATAGA TATCAAAGGG
CCTTTGCGTG CCGGGGCTCC GGAGACCGAA CTGGCGGCTA TTTTTGCCCG GGCCGTCAGC
TTGAAGCCCG CCGAACACCA CATGGAGAAC GGCTGGCGCC AGCCCCGGGC CATGTCCCAG
ATAGGCGGCT GA
 
Protein sequence
MQDTFQRQIN YLRIAITDRC NLRCRYCMPA TGVPLKGHED ILRLEEIATL ARVAAGTGIS 
RIRLTGGEPL VRKNVVTLVR ELAAIPGLEE ISLTTNGIFL GALAFSLKEA GLKRVNISLD
TLKKDRYRYI TRRGNITSVW QGIRAALAAG LTPVKLNVVI TRGFNDDEIL DFARLAREEP
LHIRFIELMP IGTAAASGTA YVPAEEIKGR ISRVYPLEPF PDLATNGPAA NFRLVGGRGS
VGFITPMSNH FCSRCNRLRL TADGKLRPCL YWDGEIDIKG PLRAGAPETE LAAIFARAVS
LKPAEHHMEN GWRQPRAMSQ IGG