Gene Moth_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2003 
Symbol 
ID3831957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2089404 
End bp2090771 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID637829932 
ProductMobA-like protein-like 
Protein accessionYP_430842 
Protein GI83590833 
COG category[R] General function prediction only 
COG ID[COG2068] Uncharacterized MobA-related protein 
TIGRFAM ID[TIGR03172] probable selenium-dependent hydroxylase accessory protein YqeC
[TIGR03310] molybdenum hydroxylase accessory protein, YgfJ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000020239 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGC GCCAGGCTTT AGATTTACAA GCCAAGGAGA TTATTACCAT GGTGGGGGCC 
GGGGGTAAGA CTTCGGCCCT TATTTGTTTG GCCCGGGAGC TGGCGGCCGC CGGGAAACGG
GTAATTGTAG CCCCTACCAC CAGGATGCTC CCCGGCCAGC TTTCCCGCCT GGCCGAACCT
ATTCTCAACA GTGACGGTAA CTGTTTAACG AAAACTGTCA AGTACCGCTT GCAGAGAGAA
AATCTGATTA CCTGCGGTTC CGGTATCGAT GACCGGGGCA AGGTAATAGG TCTTGACGCG
GCCACGGTAG CGGCCCTGGG CGATCTGGAC GTCGATTACC TGCTCCTGGA AGGTGATGGG
GCGGCGGGCG CCCTCCTCAA AGCCCCGGCG GCCCATGAGC CGGTCATCCC GCCGGTCACG
ACCATGGTGA TCGCGGTAGC CGGCCTGCCG GTCCTGGGCC AGCCCCTGGC CCCGCCCTTT
GTCCACCGGC CCCGACTAGT GGCCGGACTC CTGGGACGGG GAGAGGACTG TCCCCTGGCG
ACAGCCGATG TAGCCAGGGT TCTGGCCTAC CCGGATGGGG GCCGCAAAGG AGTACCGCCG
GGGGCCCGCT GGCTGGCCCT CCTCAATCAG GCCGAGGGTT ACGACCTTTT ACGTCGGGGA
CGAGAGGTAG CCGCCGCCAT CTTTAACGCC GGCGGGGAAA AGGTGATCCT GGGTGCGGTG
GCTACTGCAA ACCCGGTACG CCAGGTCCTT GGGGGCCCGG ACCCTCCGGC CGGCAAGGTG
GGGGTTATCG TCCTGGCCGC CGGCGCCGGG GAGCGGATGG GTGGTGGTAA GCTTTTATTG
CCCCTCAAGG GCCAGCCCCT GGTACGCCGG GTGGTGCTTA CCGCCCTGGA AGCCGCCGGG
GACAAAGTCG TTGTCGTCCT GGGCCATGAA AGCGATAAAA CGGCTGCCGC CCTGGCAGGT
TTAAAGGTAG ATTTGGCCAT TAACCCAGCC TATCGCCTGG GCCTCAGCAC CTCCCTGCAA
GCCGGCCTGG CGGCCCTGCC GCCCCGGACC CCGGCGGCCC TCTTTGTCCT GGCCGATCAG
CCGGGAGTTA CGCCGTCTGT CCTCAGGCAG CTTTTAGAAG CCTACCGGCC GGGGGGCAGG
AGGATAATTG TCCCGGTTTA CCGCGGCCGG CGGGGGAATC CTGTCCTTAT CGATCGCGGC
CTCTGGCCGC AAATCCTGAA CCTTAAGGGA GACATCGGCG CCCGGGAAAT TATCCGGGAA
TATCCAGAAG AGGTCTTGCC GGTGGAAGTC TCCTGCCCGG GGATTTTTCA GGATATCGAT
ACCCCGGCTG ATTACCGGGC CTGGTTGCAG GAAAATAGTC CTTGCTAA
 
Protein sequence
MQLRQALDLQ AKEIITMVGA GGKTSALICL ARELAAAGKR VIVAPTTRML PGQLSRLAEP 
ILNSDGNCLT KTVKYRLQRE NLITCGSGID DRGKVIGLDA ATVAALGDLD VDYLLLEGDG
AAGALLKAPA AHEPVIPPVT TMVIAVAGLP VLGQPLAPPF VHRPRLVAGL LGRGEDCPLA
TADVARVLAY PDGGRKGVPP GARWLALLNQ AEGYDLLRRG REVAAAIFNA GGEKVILGAV
ATANPVRQVL GGPDPPAGKV GVIVLAAGAG ERMGGGKLLL PLKGQPLVRR VVLTALEAAG
DKVVVVLGHE SDKTAAALAG LKVDLAINPA YRLGLSTSLQ AGLAALPPRT PAALFVLADQ
PGVTPSVLRQ LLEAYRPGGR RIIVPVYRGR RGNPVLIDRG LWPQILNLKG DIGAREIIRE
YPEEVLPVEV SCPGIFQDID TPADYRAWLQ ENSPC