Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2315 |
Symbol | |
ID | 3831067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 2437566 |
End bp | 2439266 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637830239 |
Product | hypothetical protein |
Protein accession | YP_431145 |
Protein GI | 83591136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.326994 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAAAC GGTTATTGTT ATTTTTTATC GTTTTCTGCT TTATGTTAAC TCTCTCCGGT ACGGGGAGCA GGCCCGAGGG CTGGCCCCTG ACAGCGCCGG TGGTGCAGGC GGATATTGTT GTCTATGGCG GCGGCCTGGC CGGGTGCGCG GCGGCCTGGA AAGCCGCGGT CGCGGCGCCG GACAAGCAGG TGGCCCTGGT GGTCCCCTAC CCCGGCCGCG AGTACGGCGG CCTGGCTACG GTGGGCGGCC AGAACTTCTG GGACCTGCGC TACTGGGACC GGGACGGGAA GCTGGCCCAG GGGGGCTCAT TTGCCCACTG GTTGAAAGCG GTAGGTCCTT TTTACCGCAC GGCAGATCTT GCGGCGCGGA TTGCCGCGGA CCTGGGAAGG CTGCCGAACC TCAAAACTTA CTGGGCTATG GACATAACCG CCGTCCGGAA GGACCGGCGT GGCCGCCTGC GCGCCCTGGC ATTGCGGGAA CTGCAACGTG ATGCGACCGG GACCATCGCC TGGGGTAGCG GGAAAGAAAT CCTGGCGGCG CCGATATTTG TTGACGCCTC CGAGGACGGC AGGCTCAGCC GCCTGAGCCA GGCCGGGGTG ACGGTGGGCC GGGCGGACTG GCCGGCCGAA CTCCTGACCA GAGATTTTCT GAATGGCAGC AGCGACCTCC GCCCGCGCCA GCAAGCGGCT ACTTTGATGT TTAAGGTGAA GGGGGTTCGG CCGGGCCGCT ACCCGGATAT GGTTTTTCAA CGAGCAAAAG GGGTATGGGG GGCCTACGGC GGTAAAAAAG TTTACATAAG CGACCCGGTG GTTACCGCTT TTAACGATAA ATACGGCCCC CTGGGCTTTG CCCTGAAACC CCTCAACGCC GCCCAGGACG GGCCGGGAAG CCCGGAGTGG TGGGTTAATG CCCTCCTTAT CTTTAACGTC GACGGCCGGG CCAACGGCCG CGACCGGGGG CATGATGCCT ATCCCGGGGA TATGGCGCCA GGGGCCCTGG ACACGGATAC GGCCTGGCAG CGGGCGCGCC AGTTACTGGA CGACCCGGAG TTCATCCGGG CCTTGCGCCG TTTTCAAGGG TTCGAAGAGG CCGAGGTGGT GCGGGATGCA GCAGGCCGGC CGGTGACCGG CGATATCCTC TACCTGCGGG AGACAGTGCA CACGGTGATG GACCCCAGGG AAACCCGTCC GGGGACTGAG AATAACAATT ACGCCCTGAA CGCATCGGCT GCCCGGGGCG CAGGTTCTGG CCCCCCTGAT GGCGATGATC TCGGGAATTA TGCCAACCGT ATCGGCCTGG GCTTTTACTG GCAGGATATC AATGCCTATC ATTTCAGCGA TCTTAAAGGG AGCGACGGCA GCTACCGCTG GCCGGTGACG CCCTTTTTGC GGCCCGATTA TCCTCGGACA ACGCCGGGAC CAGACGGATG GCCACAAAAT CCGGTGTATA TTCCCTTTAA CGTTCTCCTT AGCCGGACGG TGCCCAACCT CCTTATCCCT GGTTATGCAG CCAGCATCTC CTCCCTGGCC TGGGCCGAGC TGCGGGTGCT GCCCAATAGT TGCGTCCTGG GAGATGCGGC CGGGGTAGCG GCTGCCTATG CCGCCCGTGT GGGCCGCGAC CCGGGCACCT TCACTGACGC CGATGTAGCG GCTATCAGGG AAATACTGGT CAAAAGTTTC GGCGCCAGGG TAGATAAGTG A
|
Protein sequence | MGKRLLLFFI VFCFMLTLSG TGSRPEGWPL TAPVVQADIV VYGGGLAGCA AAWKAAVAAP DKQVALVVPY PGREYGGLAT VGGQNFWDLR YWDRDGKLAQ GGSFAHWLKA VGPFYRTADL AARIAADLGR LPNLKTYWAM DITAVRKDRR GRLRALALRE LQRDATGTIA WGSGKEILAA PIFVDASEDG RLSRLSQAGV TVGRADWPAE LLTRDFLNGS SDLRPRQQAA TLMFKVKGVR PGRYPDMVFQ RAKGVWGAYG GKKVYISDPV VTAFNDKYGP LGFALKPLNA AQDGPGSPEW WVNALLIFNV DGRANGRDRG HDAYPGDMAP GALDTDTAWQ RARQLLDDPE FIRALRRFQG FEEAEVVRDA AGRPVTGDIL YLRETVHTVM DPRETRPGTE NNNYALNASA ARGAGSGPPD GDDLGNYANR IGLGFYWQDI NAYHFSDLKG SDGSYRWPVT PFLRPDYPRT TPGPDGWPQN PVYIPFNVLL SRTVPNLLIP GYAASISSLA WAELRVLPNS CVLGDAAGVA AAYAARVGRD PGTFTDADVA AIREILVKSF GARVDK
|
| |