Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0748 |
Symbol | |
ID | 3831140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 784776 |
End bp | 785942 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637828679 |
Product | hypothetical protein |
Protein accession | YP_429609 |
Protein GI | 83589600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCG TAAACTTCCA AATAATGTAT GTTACTACAC CGGTACCGGG GCTTACCTAT GATGTACCTT CTCTTCCTGC CTCGGGGAAA ATAGAGGCCT TTAATAGGAC TGTCACCCTG ACGTACCCCA GGGGAGACGT CCTGGTCGAT GGCAGCGGTC AAATTACGAA TAGCTCGGTT GCCTTTACGG TTTACGACCC TCCCACCACC AGGCCGGATA ATTACCATTT CCTGGCCAGC CAGGTCATAC AAATTTCCGT TCCCCAGGCC GTTTATCTGT TACAACCCGG GCAGCTCACC CTCAGTTATG ATCCGAATAT CAGCGCGGCT GTGGCCGACC AGCTGGCCAT CTGGTACAGC CCGGATAATG ATTGGAGCGA TGATGATAAC TATATCCTGG GCGGCCGTAC TGATGTCCGC AATCACACTG TGACCGCCCC CTTCCAGTTC AATGGTAAGA CAGAAGGCTA TTATGCCGTT TTCCTGGCGG ACCGTGTATT TAAAGATTTT ACCAGCCAGT CCGAAGCAGC CTGGGCCTAT TCCAGCGTAA TGCCTATGTG GGCTAAAGGG CTGGTTGAGC CCCTACCAAT CGATAGAACA GATAATAACT TCGGGGCAGG AAACAATATC AACCGCCTGG AGTTTACCAC CATACTCATC AAAGGGCTGG GATTACCGAT AATAGAAAAA ACATCTTCGG TTTTTAGTGA TGTATATGCA ACTCCAGAAG GGAATAATGG CTCATATTTT GGTAGCAATT ACTCATCAGC AAATGGGTTT AAGATATACG ATCAATCTCA TACACCGGTA CAATACGCCG AAACCGCCGC CCGATATGGT ATTATAGCAG GATATCCTGA TGGCACTTTC AAACCAGCAA ATAGTCTAAC CCGCCAGGAA GCGGCCGTCA TTTTAGCCCG GGCCGCCAAC TTGAAGTTGC TTAGCGACGG TGATAAAGCC AAGACGGATC TGGAAAGGAT TTTTACAGAT GCCTCTGCTA TTCAGCCGTG GGCGGCACCT TCAATACTGG CGGCGTACCG CGCCAGGCTG ATTGCCGGCC TCCCCGATGG AGATAAGAAG ATAAAGTTTG CTCCGAACGA TCCTCTCACC AGAGCCCAGG CGATTACCAT GGTTTACCGC CTGCTGGAGA AGCAAAAAAA GATATAG
|
Protein sequence | MDIVNFQIMY VTTPVPGLTY DVPSLPASGK IEAFNRTVTL TYPRGDVLVD GSGQITNSSV AFTVYDPPTT RPDNYHFLAS QVIQISVPQA VYLLQPGQLT LSYDPNISAA VADQLAIWYS PDNDWSDDDN YILGGRTDVR NHTVTAPFQF NGKTEGYYAV FLADRVFKDF TSQSEAAWAY SSVMPMWAKG LVEPLPIDRT DNNFGAGNNI NRLEFTTILI KGLGLPIIEK TSSVFSDVYA TPEGNNGSYF GSNYSSANGF KIYDQSHTPV QYAETAARYG IIAGYPDGTF KPANSLTRQE AAVILARAAN LKLLSDGDKA KTDLERIFTD ASAIQPWAAP SILAAYRARL IAGLPDGDKK IKFAPNDPLT RAQAITMVYR LLEKQKKI
|
| |