Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0480 |
Symbol | |
ID | 3832418 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 484786 |
End bp | 488370 |
Gene Length | 3585 bp |
Protein Length | 1194 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637828414 |
Product | hypothetical protein |
Protein accession | YP_429353 |
Protein GI | 83589344 |
COG category | [S] Function unknown |
COG ID | [COG4717] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCGG TAACCTGGCA GCGTCTGAGG CTGGCCGGTT TCGGCTGCTA CCGGGAGGGG GTGACCGTTG TTTTCCAGGA AGGCCTCAAC GTCCTCGTGG CCCCCAATGA AAAGGGCAAG TCAACCCTGG TTGCCGGCCT GGAAGCCGTC CTTTTCGGCC TCCCCAATAG CGGCAACCCG GAGGCCTTTG GCAGCGCCCG TTTTGCCAAC CGGGAAGAGC CGGACCGGTT TGAAGGCGAG TTGGAATTCC TGGTGGACGG CCGGGCCTAT GCAGTGAGGC GTCAATTTAA TAATAACCGG GTGACCATTA GGACCAGGGG GGCTCATGGC TGGGAAGAGG TCTGGCGGGG GAGCCACAAC CCGGCAGCCC ACAAGCACAA CCCGGTCTAT AACGAGCACC TGGCTTCCCT ACTGGGAATG ACCTCCCGGG AACTCTTTGA AGCCACCTTC TGCCTGGGCC AGCCCCTGCC GGAAGGAAGG GCTTTAAGCA GTGAGGTGCA AAAGCTCCTT TCCGGCAGCG GCGGCCACTA CCAGGAAGCC CTGAAAATGC TGGCTGACGA CCTGAGGGCC CTGACCCGTT ATTGCGGTGA CCGGGGCGTG GGCAACAACG GGCGCAAGGA TGCCCGGCTG GAGGAATTGC AGGCGGAGAT TGACTCTTTA CGGGCGCAGC AGCAGGCTAC CGGCAGTACC ATTGAAGAAC TGGCTGCCGT GCGTTCCCGG CTCCAGGAAC TGACGGCAGA GATTAAAGCA GTCCGGGTGC AACTAAAAAA ACAACAGGAC CTTCTGGCAG CCTGGGACGG ATGGCGCGCC CTGCAATTGA GGTACAAAAG CGCCCTGCGC GAACAGCAGC AGGCCGGGGC GGTCCAGGAA AGAGCCCGGG AACTGGAGGA GAAGATCTGC CGGCACAAGG AGCTGGCGGC CCGGTCGTAT CCCGAGTGGG CCGGCGCCCC GGCAGGTAGC GCCGGCAAAC TCGACCGCCT AGCGGAGCTG GAGCAGGAAA TAGCCCGCTT AAAGGGCGAG GCGGCGAACT GGGAGGCGCA GCTCCGGGAT TTAAACGCGG AGCAGGAGGC ATTAACAGCC AGGCTGCAGG GAGAACTGGC GGCGGTGGCT GATCACCCGG ACATCCTGCG GGACCTGGAG GACCTGCAGA GCCTCCTCAA GGAGCGGGAG GAACTGGAAG GGCAGGTGCG TGAACTGGCG GCCCGGGCGG CAGAAACGGC TGCTGAGCTG GCCGCCCTGC CTGATTTCAG CTTGCTGGGA AACAGCCCGG CCACGGTCCT GGAGGGCTTG CAGGTGGCCG CGCGTCAGCT CCTGGAGGAA TGGGGCCGCT TCCAGGAGCA GCAAAACCGC TATAAGGAAC TGGAAGCGGA ACTGGCGGGG GATCTGAGCT GTTTTGCCGG GCTGCTGCCG GAAAAGGAAA CCCTGCTGGC CAACTACGCC GCCACCAGGC TCGCCCTGGA AAGGGAAGAG CAGGCGGCCC GGGACGCCTG CCGGCGCCTG GAGGAACAAA AGGCCGCCTA CCGGCGCCGC GAGGAAGAGT TCCGGCGAGA ATTCGGCGAC CTGGAAGACC TGGGGGGAGA AGCCATCCAG GCAGCGACAG ATAAACCCCG GATCCTGGAG GAATTACAGG CCAGGGAAGC AGCCAGGAAG GCGGCCCTGG CAGCCTGCAG GCGCAGGAGG CTGGCCACAG CCCTGGGCCT GGCGGGCGCC GGCCTGGTGG GGGGGCTGGT AACCGGCCTG GCCACCGGCA ATTGGCTTCT CGCCCTAGCA ACTGCCCTGG TCCTGGCCGG CCTGGGCTAT GCTGCCGGAT CTTACCTGGG CCGACCCGGT AGGGACATCC TGGACCTGAC CGCGGCGATT ACCACCCTGC AGGGGCGCCT GGCGCAGCTG GATGCTATCT TAGGCCCCTG GGCGGGGGCC ACGGCGGCTG AACTGGGGGA GCTCCGCCAG CGCCTGCGGG AGCGGGATAG GGTCCGGGAG GAACTGGCAA CCCTGGCCGC TTCTTTGCCC GGCACGGCTG AAGAAGAGGC GGCCCGGCAG GCTCTGGAGG CAGCAACCAG GGTTAGAAGC GATTTCCATA CCGCTACCGC CGCCATCAGC GCCCGCTTTA GCGATGTGAC GGCAGCCTAC CGCCGTTACC TGACTACCAG GGAAGAAAAG CGGCGCCTGG CAGCTGAGAT GGAATCATTT ACCATAAAGG CTGTGGGAAC ACCGCCGCCG GCGGCTTTAA CAGTACCCCT TGCCGGCCTG GGAGCCCCAT GGCCGGGTCT TGCCTCCCTG GCGCGGCTGG CGGGGCAGGC TCCGGCCACG GTGGGCGAGC TTTTAACCTG GTTAAAGGAA CTGGATCATG CCGCCTGGGA AGATTTCCTG GCCCGGGCGC GGCGGTGGGA AGAGTTGACG TATCTTAGCC GGCAGGTGCA AGAAAAGAGG GAGCAGCTCC TTAAACCGGA TAAAGAGGGG CGGACGGCCC TGGAACGCCT GGCGAAGAAG ATTGCCGCCC TGGAAAGACA CGTGGCCCCC TTTACCGCGG CGATCGATCC CGGAGCCGTT GCCCCCCTCC TGGAGGAGGC GGGCAGGATA AAGGAACGCC TGGGCGAGCT GGCCGGCCGG CGGCAGGCCG CCGGGAGCCG GGTAAAGGAT TTACGGGAGC AAATCCAGGA GCTGGAACCT GAGGCGGCCG GCTTGCGGGA AGATCTGGCC GCCATCCTGG CACCGGGCGG GGGAGGGGCC GGTAAAGCCC TGGAGCGCCG GCAGGCCTAT GAACAATTGG CCCAGGAGTG GCAGGGCTGG CAGGAACAGC TGGCCGGCCT CCTGGGTGAA GGCGACCTGG AAGACCTGGC CACAGCTTAT CTGGATGCTG CCAACCGGAC CGTGGCCGTT CTGCAGGAAT GGCAGGACCT GGTCCGGGAG CATCCGGGCC TGCCCGAACC CGGAGGGGAA ACAAAAGGGG AGGAATTGGA AGAACAGTAC AGGGCCCTCC GGGCAGGGAT AATAGAAACG GAAGGCCGGT TACGGGGGCT GGAGGAAGAA GAACGTCAGC TCCTGCGCCG CCAGTCCCAA TTGGAAGGCA GCCAAATCGC CAACATGGCC ATTGTTGCCG AGACCCTGGC GGCCAGGGAA AAGGAGCTGG AGCGCCTGGA GCTGGAGGCC GGGGCCCTGG CCCTGGCCTA CCGGGAACTG GCGGCCGCGG CCCGGGACTA CAGCCAGAAC TACCGGCGGG AGCTGGCCCG GACCGCCAGC CGCTATTTCA ACCTCTTTAC CGGCCACCGG GAACGACAAG TTGATATAAC TGAAGATTTC CAGGTAGAGG TACGGGAAGC CGGGGTGGTT ATGGCCCTGG CCCAGCTCAG CCGCGGCGCC CAGGATCAGC TCTACCTCTC CTTACGCCTG GCTATCGGTG ACCTCCTGAG CGCCAACCTG ACCCTGCCTT TTATCTTCGA CGACTGCTTT GTCAACTGCG ATGCCGCCCG CCGGGAGCGC ATCCGGGAAA GCCTTGCGGC CCTGGCCCTC GAGCGGCAGT TGATTCTTTT ATCCCATGAC CCGGATTTTG CCGCCTGGGG GATGCCGGTG AAGATCGAAA AGTAA
|
Protein sequence | MPAVTWQRLR LAGFGCYREG VTVVFQEGLN VLVAPNEKGK STLVAGLEAV LFGLPNSGNP EAFGSARFAN REEPDRFEGE LEFLVDGRAY AVRRQFNNNR VTIRTRGAHG WEEVWRGSHN PAAHKHNPVY NEHLASLLGM TSRELFEATF CLGQPLPEGR ALSSEVQKLL SGSGGHYQEA LKMLADDLRA LTRYCGDRGV GNNGRKDARL EELQAEIDSL RAQQQATGST IEELAAVRSR LQELTAEIKA VRVQLKKQQD LLAAWDGWRA LQLRYKSALR EQQQAGAVQE RARELEEKIC RHKELAARSY PEWAGAPAGS AGKLDRLAEL EQEIARLKGE AANWEAQLRD LNAEQEALTA RLQGELAAVA DHPDILRDLE DLQSLLKERE ELEGQVRELA ARAAETAAEL AALPDFSLLG NSPATVLEGL QVAARQLLEE WGRFQEQQNR YKELEAELAG DLSCFAGLLP EKETLLANYA ATRLALEREE QAARDACRRL EEQKAAYRRR EEEFRREFGD LEDLGGEAIQ AATDKPRILE ELQAREAARK AALAACRRRR LATALGLAGA GLVGGLVTGL ATGNWLLALA TALVLAGLGY AAGSYLGRPG RDILDLTAAI TTLQGRLAQL DAILGPWAGA TAAELGELRQ RLRERDRVRE ELATLAASLP GTAEEEAARQ ALEAATRVRS DFHTATAAIS ARFSDVTAAY RRYLTTREEK RRLAAEMESF TIKAVGTPPP AALTVPLAGL GAPWPGLASL ARLAGQAPAT VGELLTWLKE LDHAAWEDFL ARARRWEELT YLSRQVQEKR EQLLKPDKEG RTALERLAKK IAALERHVAP FTAAIDPGAV APLLEEAGRI KERLGELAGR RQAAGSRVKD LREQIQELEP EAAGLREDLA AILAPGGGGA GKALERRQAY EQLAQEWQGW QEQLAGLLGE GDLEDLATAY LDAANRTVAV LQEWQDLVRE HPGLPEPGGE TKGEELEEQY RALRAGIIET EGRLRGLEEE ERQLLRRQSQ LEGSQIANMA IVAETLAARE KELERLELEA GALALAYREL AAAARDYSQN YRRELARTAS RYFNLFTGHR ERQVDITEDF QVEVREAGVV MALAQLSRGA QDQLYLSLRL AIGDLLSANL TLPFIFDDCF VNCDAARRER IRESLAALAL ERQLILLSHD PDFAAWGMPV KIEK
|
| |