Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1697 |
Symbol | |
ID | 3833297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1735263 |
End bp | 1736498 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829622 |
Product | hypothetical protein |
Protein accession | YP_430542 |
Protein GI | 83590533 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.600188 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCG CTGAACGCGG TATAGTCATG TCCAGGGAGG GGCAGAGGGT CATCGTTTTA ACGCCCCGGG GCGACTGGCG GGCTTTAAAA TTGGCCGGAC CCCTGCCGGA AGTGGGGGAA GAAATTATGC TGCCGCCGGT GGTCAAGAGG CCGGCGTGGC CCCTCTTTAC TGCCGCCGCA GTAGTTCTTT TACTGGTCCT GGCCGGGGCT GTAGTGCGCC GGATAGAAAC CCCGGCGCCG GCGGCTGCCC CTATGGTAGC CTATTATGTT AATATAGATA TTAACCCCAG CGTAGAGCTG GCCGTGGATG AAAAGGATAC CGTCCTCGAG GCCCGCGGCC TGAACAACGA CGGGGAAAAA TTACTCGCCG GCATTGCCCT GAAGGGGGAA AAGGTTACCG GGGCTATGAA GATTCTGGCC CTGGAGGCCC TGCGCCAGGG TTATTACCTT CCCGAGGGGG AAGGCGCCAT GATGGTAACA GTAATCCCTG CCGGGTCCGG CCAGGAGAAA CTGGCGGCCG GGGATGAACT GGGCCAGAGG CTCACCCGCG AAGCGCAGGA TGTCTTTCAG CAGGCCGGGG TACATGCTGC CGTAGAGGCA GCTACCGTCC AGCCGGAGAT CCGCCAGCAT GCCGAGGCCG CCGGCCTCTC GGCCGGTAAG TACAGCATTA TGCTGGAGGC CCTGGCAGCC GGGGCCCAGG TAAAGGCTGT CGATTTGCAG CGGGAGAGTA TTACTAAAGT CCTGCAGGAA CTCAATTTTA ACTGGGAAGA TGTGCTTGCC CGGCTAAAAA GGGATCCTGA CCTGTTAAAG AGGGAAGAGC AACTGGGACC GGTCCTGAAG GCGGCCCTGG GTCAGGGCCC ACTCCCCGCG GAAAACGGAA ATAGCCAGGG TAATGCTCCC GCCAGGGGTC CGGCAGCAGC ACCGGCTAAT AAGCCCGACC AGGGGGATAA TCAGGAAACC CGGCAAGGCA AACAGGAAAC CACCGGCCAG GGGAGGGAGA TGAACTCGAA CCGTGGCCAG TCCTCCAGCC GGGATGGTAT GGCAGTAGCC TGGCAGCTAC GAGCCCGGCT CAAGGCCGGA CTGCAAAACC AGCCGGGGGG ACCGGTCCTC AAAGAACTGC CGGTACTAGA ACATGTCCCC GGGAAAAACC TGCGGGACGT GCTGGCAAAA ATAAAATTGG AAGACGTGGT TTTAAAGAAA ATAGAGGAAA AACGGCAGGA TATGGCGAAA AGATAG
|
Protein sequence | MDGAERGIVM SREGQRVIVL TPRGDWRALK LAGPLPEVGE EIMLPPVVKR PAWPLFTAAA VVLLLVLAGA VVRRIETPAP AAAPMVAYYV NIDINPSVEL AVDEKDTVLE ARGLNNDGEK LLAGIALKGE KVTGAMKILA LEALRQGYYL PEGEGAMMVT VIPAGSGQEK LAAGDELGQR LTREAQDVFQ QAGVHAAVEA ATVQPEIRQH AEAAGLSAGK YSIMLEALAA GAQVKAVDLQ RESITKVLQE LNFNWEDVLA RLKRDPDLLK REEQLGPVLK AALGQGPLPA ENGNSQGNAP ARGPAAAPAN KPDQGDNQET RQGKQETTGQ GREMNSNRGQ SSSRDGMAVA WQLRARLKAG LQNQPGGPVL KELPVLEHVP GKNLRDVLAK IKLEDVVLKK IEEKRQDMAK R
|
| |