Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2003 |
Symbol | |
ID | 3831957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2089404 |
End bp | 2090771 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637829932 |
Product | MobA-like protein-like |
Protein accession | YP_430842 |
Protein GI | 83590833 |
COG category | [R] General function prediction only |
COG ID | [COG2068] Uncharacterized MobA-related protein |
TIGRFAM ID | [TIGR03172] probable selenium-dependent hydroxylase accessory protein YqeC [TIGR03310] molybdenum hydroxylase accessory protein, YgfJ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000020239 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTGC GCCAGGCTTT AGATTTACAA GCCAAGGAGA TTATTACCAT GGTGGGGGCC GGGGGTAAGA CTTCGGCCCT TATTTGTTTG GCCCGGGAGC TGGCGGCCGC CGGGAAACGG GTAATTGTAG CCCCTACCAC CAGGATGCTC CCCGGCCAGC TTTCCCGCCT GGCCGAACCT ATTCTCAACA GTGACGGTAA CTGTTTAACG AAAACTGTCA AGTACCGCTT GCAGAGAGAA AATCTGATTA CCTGCGGTTC CGGTATCGAT GACCGGGGCA AGGTAATAGG TCTTGACGCG GCCACGGTAG CGGCCCTGGG CGATCTGGAC GTCGATTACC TGCTCCTGGA AGGTGATGGG GCGGCGGGCG CCCTCCTCAA AGCCCCGGCG GCCCATGAGC CGGTCATCCC GCCGGTCACG ACCATGGTGA TCGCGGTAGC CGGCCTGCCG GTCCTGGGCC AGCCCCTGGC CCCGCCCTTT GTCCACCGGC CCCGACTAGT GGCCGGACTC CTGGGACGGG GAGAGGACTG TCCCCTGGCG ACAGCCGATG TAGCCAGGGT TCTGGCCTAC CCGGATGGGG GCCGCAAAGG AGTACCGCCG GGGGCCCGCT GGCTGGCCCT CCTCAATCAG GCCGAGGGTT ACGACCTTTT ACGTCGGGGA CGAGAGGTAG CCGCCGCCAT CTTTAACGCC GGCGGGGAAA AGGTGATCCT GGGTGCGGTG GCTACTGCAA ACCCGGTACG CCAGGTCCTT GGGGGCCCGG ACCCTCCGGC CGGCAAGGTG GGGGTTATCG TCCTGGCCGC CGGCGCCGGG GAGCGGATGG GTGGTGGTAA GCTTTTATTG CCCCTCAAGG GCCAGCCCCT GGTACGCCGG GTGGTGCTTA CCGCCCTGGA AGCCGCCGGG GACAAAGTCG TTGTCGTCCT GGGCCATGAA AGCGATAAAA CGGCTGCCGC CCTGGCAGGT TTAAAGGTAG ATTTGGCCAT TAACCCAGCC TATCGCCTGG GCCTCAGCAC CTCCCTGCAA GCCGGCCTGG CGGCCCTGCC GCCCCGGACC CCGGCGGCCC TCTTTGTCCT GGCCGATCAG CCGGGAGTTA CGCCGTCTGT CCTCAGGCAG CTTTTAGAAG CCTACCGGCC GGGGGGCAGG AGGATAATTG TCCCGGTTTA CCGCGGCCGG CGGGGGAATC CTGTCCTTAT CGATCGCGGC CTCTGGCCGC AAATCCTGAA CCTTAAGGGA GACATCGGCG CCCGGGAAAT TATCCGGGAA TATCCAGAAG AGGTCTTGCC GGTGGAAGTC TCCTGCCCGG GGATTTTTCA GGATATCGAT ACCCCGGCTG ATTACCGGGC CTGGTTGCAG GAAAATAGTC CTTGCTAA
|
Protein sequence | MQLRQALDLQ AKEIITMVGA GGKTSALICL ARELAAAGKR VIVAPTTRML PGQLSRLAEP ILNSDGNCLT KTVKYRLQRE NLITCGSGID DRGKVIGLDA ATVAALGDLD VDYLLLEGDG AAGALLKAPA AHEPVIPPVT TMVIAVAGLP VLGQPLAPPF VHRPRLVAGL LGRGEDCPLA TADVARVLAY PDGGRKGVPP GARWLALLNQ AEGYDLLRRG REVAAAIFNA GGEKVILGAV ATANPVRQVL GGPDPPAGKV GVIVLAAGAG ERMGGGKLLL PLKGQPLVRR VVLTALEAAG DKVVVVLGHE SDKTAAALAG LKVDLAINPA YRLGLSTSLQ AGLAALPPRT PAALFVLADQ PGVTPSVLRQ LLEAYRPGGR RIIVPVYRGR RGNPVLIDRG LWPQILNLKG DIGAREIIRE YPEEVLPVEV SCPGIFQDID TPADYRAWLQ ENSPC
|
| |