Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1665 |
Symbol | |
ID | 3831936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1698764 |
End bp | 1701022 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637829590 |
Product | hypothetical protein |
Protein accession | YP_430510 |
Protein GI | 83590501 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.590351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000463934 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCCGATC TGCGCCGTCC CAACCGTTTA ATCCATGAAA AAAGCCCCTA TCTGCTGCAG CACGCCTACA ACCCCGTCGA TTGGTATCCC TGGGGGGAAG AGGCCTTTGC CCGGGCCAAG CGGGAAGATA AGCCGGTATT TTTATCCATC GGCTATTCCA CCTGTCACTG GTGCCACGTC ATGGCCAGGG AATCCTTTAA CGATGAAGAG GTAGCCGCCC TCCTCAACGA TAGCTTTATC GCTATCAAGG TCGACCGGGA AGAACGGCCG GACATCGACC AGGTCTATAT GGCCGCCTGC CAGGCCCTGA CCGGCAGCGG CGGCTGGCCC CTGACTGTTT TCCTGACGCC GGAGAAAAGA CCCTTTTATG CCGGCACCTA TTTCCCCAAG CACAACCGGT ACGGCAGGCC CGGCCTGGTG GAGCTTTTAA AGTTGATCCG GGAAAAATGG GCGACCCACC GCGAGGAGCT GGAAGAATCC GGCGCGGAAT TGATACAACA CGTGGCCGGC CAATTTGCCC CTACGCCGCC GGGAGAACCC GGGGCTCAGG TCCTGGAAAA GGGCTGGCAA CAACTCCGGG CCGGTTTCGA CCCTCTCTAT GGCGGTTTCA GTGAAGCTCC TAAATTTCCC AGCCCCCACC AGCTTTTGTT TTTACTGCGT TACTGGAAAA GGTATGATGA AGCGGGCGCC CTGGCCATGG TGGAAAAAAC CCTGCAGGCT ATGTACTGCG GTGGCATTTA CGACCATATC GGTTTTGGCT TCGCCCGCTA TTCTACCGAC CGCCGCTGGC TGGTGCCTCA CTTTGAGAAA ATGCTCTACG ACAACGCCCT GCTGGCCCTG GCCTACCTGG AAACCCGGCA AGCAACGGGC AAAGCTGTCT ACAGCCATGT CGCCCGGGAG ATATTCACCT GGGTGTTGCG GGACATGACC AGCCCGGAGG GAGGATTTTA CTCCGCCCTG GACGCCGATT CCGAAGGAGA AGAGGGCCGT TTTTACCTCT GGACACCCGA CCAGGTCCGG GAGGTCCTCG GTGCTAAAGA GGGGGAGTTC TTTTGCCGTT ACTTTGATAT AACCGCCGGA GGTAACTTCG AGGGGCGCAG TATTCCCAAT TTGATTGGCC GGGGAGAAGC CCTCTTTGCC GCCGGTACCA GTGGCAATGA GAGCAATGAT ACCGCTGGCG ATCAGAGGCA GCCCCGGGAG CAAGGCGGGA GAGCAGGCGG CATTTCCGGC GGCGGGGGCT GCGCTAAAGG TAGTCCGGAG GAGGACCGGC TGCCCGGCCG GGGGCCAACC ACCCTGGCCG GTTTTGGCCC GGCAACGGCG GCCCGATTGG CTGCGGCCCG CGAAAAACTC TTCGCCGCCC GGGAAAAGCG GGTCCATCCC CACCGGGACG ATAAAATCCT CACTGCCTGG AACGGGCTGA TGATCGCCGC CCTGGCCCGG GGCGCCTGGG TGCTCGATGA ACCCGCCTAT GCCGCGGCGG CCGCCAGGGC GGCCCGGTTT ATCCTAACCC ACTTGCGCGA TGCGGAGGGG CGCCTGCAGG CCCGTTACCG GGAAGGCCAG GCTGCTTTCC CGGCCTACCT TGACGATTAT GCCTTTCTCA CCTGGGGGCT CATCGAACTC TACCAGGCCA CCTTTGAGAC AGGTTACCTC CGGGAGGCCC TGGCTTTGAC GCGGCAGATG CAGGAACTCT TCCGGGACGA AGGGGGCGGC TACTTTTTTA CCCCTCACGG CGCCGGGGAA CTACCGGTCC GCCCCCGGGA AGTCTATGAC GGCGCCATTC CCTCCGGCAA TTCGGTAGCG GCCTTAAACC TGCTGCGCCT GGCCCGCATC ACCGGGGACA GCCGGCTGGA AGAAGAAGCC GCAGCCCAGG TGCGTGCCCT GGCCGGAACG GTGGCCGAAT ACCCCCGGGG CTATTCCTTC TACCTCTGTG CCCTGGACTT CTACCTGGGG CCGGTAACAG AAATAGTCCT GGCCGGGGAA CGGGAAACAG AAGATACCCG TGCCCTGCTC CGCGTGCTAA GGGCGGCCTA CCTGCCCTCA GCCGTCCTGG TGCTGCGTCC CGGCGGCCGG GAGGGTGAGG AAGTAACCAG GCTCATTCCC TATACCGCCG GCCAGAAACC GGTAAACGGT AAAGCCACCC TATACCTGTG CCGCAACTTC GCATGCCGGG CGCCGGTTAC GACGGCCGGA GAACTGGAGC AATGGCTAGC GTCTGCCGGA CGGGAGGCTC ATGGCGCAGG CGACACTTTT AACGAATGA
|
Protein sequence | MPDLRRPNRL IHEKSPYLLQ HAYNPVDWYP WGEEAFARAK REDKPVFLSI GYSTCHWCHV MARESFNDEE VAALLNDSFI AIKVDREERP DIDQVYMAAC QALTGSGGWP LTVFLTPEKR PFYAGTYFPK HNRYGRPGLV ELLKLIREKW ATHREELEES GAELIQHVAG QFAPTPPGEP GAQVLEKGWQ QLRAGFDPLY GGFSEAPKFP SPHQLLFLLR YWKRYDEAGA LAMVEKTLQA MYCGGIYDHI GFGFARYSTD RRWLVPHFEK MLYDNALLAL AYLETRQATG KAVYSHVARE IFTWVLRDMT SPEGGFYSAL DADSEGEEGR FYLWTPDQVR EVLGAKEGEF FCRYFDITAG GNFEGRSIPN LIGRGEALFA AGTSGNESND TAGDQRQPRE QGGRAGGISG GGGCAKGSPE EDRLPGRGPT TLAGFGPATA ARLAAAREKL FAAREKRVHP HRDDKILTAW NGLMIAALAR GAWVLDEPAY AAAAARAARF ILTHLRDAEG RLQARYREGQ AAFPAYLDDY AFLTWGLIEL YQATFETGYL REALALTRQM QELFRDEGGG YFFTPHGAGE LPVRPREVYD GAIPSGNSVA ALNLLRLARI TGDSRLEEEA AAQVRALAGT VAEYPRGYSF YLCALDFYLG PVTEIVLAGE RETEDTRALL RVLRAAYLPS AVLVLRPGGR EGEEVTRLIP YTAGQKPVNG KATLYLCRNF ACRAPVTTAG ELEQWLASAG REAHGAGDTF NE
|
| |