Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0718 |
Symbol | |
ID | 3830994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 748280 |
End bp | 749584 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637828649 |
Product | hypothetical protein |
Protein accession | YP_429579 |
Protein GI | 83589570 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000121294 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0965405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAAG TAAGCTTGAC GGGGCTGATC CTATTTTTTA TGTTCCTGCC CTGGGGCACG GCCTGGGCCG GCCCCACCGC CGGCGAACTC CTGGCCCTGC TGCCGCCGGG AGCTGGGCAG GCAACGGAAT TGACCCGGGG CGGTTTTGCG GTCATGCTGG CGGTTGCTGC CGGGATTAAG GGCGATGCCG AAACCGGCGA GCTGCCCGTG GATGTGCCAC CGGAAAGCTG GTACACCCCG GCGCTGCGGG CTCTGTGGCA ACAGGGGATT ATCCAGGGTT ATCCCAACGG CACCCTGCGG CCGGAGCAAC CGATCACCTC CCTGGAAGCG GTAATCCTTA CCGCCAGGGC CATGGGATTA CCCAACGAGA TCGGCGGGCG GGAAGATAAT CCCTTGCCTG GGGAGATACC TTATGGCCTT AGCCAGTACG CCTTTTTTCA GCAGCAAGGG TTGTTGCCTC CCGGCGAACC CCTCGCACCC ATGAGCCCGG CGGAAGCGGC CAGGTGGCTG GCGGCAGTCT TCGGTTCCGA AACCAGGGCG GGAAACCTTT TGGCCCGGTG CCGCCAGGTC CTGGCAGGTA AAGAGGCCAT CCGCGTCAGA GGGACAGTTA GCCTGCAGTT CTATAACCGG CCGGGGTTAC CAACTACTGC CGAATTGGAC CGCATGGTGA TTTATGGCGA CGTTCTGAAT GAGGTTAGTA TGACAGGAAA GATGCACCAG CTGGTGACCC TGCACCTGGA AAAAAAGCAG GAGATGACCA TCGAGCAATT TGTCTCAGGT GGGTACCTCT ACCGCCGGGT GACAGGCAGT GGAAGGGAAA CCGGCGAATG GCAACGTCTA TCTTTCGCCC CTGATGTTAG CCTTTTGTTG CGCCAGCAGC AGAACCTGGG CTTGCCGGCC GGCATTTTTC CCTTCCTGCA TTACCACCTC CTGGGGGAAA GGGAGATTGC AGGCCGGCAC GTGGTGGGGG TCAGCTTTTA TGCCCGGCAA AATAACCCCG GGGCCCCCGG CGACCTCCTG CCGCTCCAGG TTTTCAGCGG CAGCGTGGAC GATTATTTCA GCCAGCCGGG TAAACTTATC CGCTCCCTGT CTTACTGGGG CGTTATTTAC CTCGATTCGG AGAGCCTGCT GCCGGTAAAG ATCGACCTTA ACCTGGTGAT GGCCTTTGAG CCTGCCCCCG GCGGGCAACC GGCAGTTATG GCCGCCATGG AGGCCCGTTT TCAGGGGAAG GACTACAACT TTGACGATTT TAAAATAGAG TTACCGGCGG CTGCGGTGGC GGCGCCAGTA AAAGAAAACC AGTAG
|
Protein sequence | MRKVSLTGLI LFFMFLPWGT AWAGPTAGEL LALLPPGAGQ ATELTRGGFA VMLAVAAGIK GDAETGELPV DVPPESWYTP ALRALWQQGI IQGYPNGTLR PEQPITSLEA VILTARAMGL PNEIGGREDN PLPGEIPYGL SQYAFFQQQG LLPPGEPLAP MSPAEAARWL AAVFGSETRA GNLLARCRQV LAGKEAIRVR GTVSLQFYNR PGLPTTAELD RMVIYGDVLN EVSMTGKMHQ LVTLHLEKKQ EMTIEQFVSG GYLYRRVTGS GRETGEWQRL SFAPDVSLLL RQQQNLGLPA GIFPFLHYHL LGEREIAGRH VVGVSFYARQ NNPGAPGDLL PLQVFSGSVD DYFSQPGKLI RSLSYWGVIY LDSESLLPVK IDLNLVMAFE PAPGGQPAVM AAMEARFQGK DYNFDDFKIE LPAAAVAAPV KENQ
|
| |