Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_2145 |
Symbol | |
ID | 3833145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 2245416 |
End bp | 2247305 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637830068 |
Product | hypothetical protein |
Protein accession | YP_430978 |
Protein GI | 83590969 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0965405 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTGGG TTTGCAGATA CAGGTTATTT ATAGTGCCTT TATTGGTTTT CTTCCTTGTT TCGGGTTTTG GCCTTGCGTG GGGCGCGGAT GAAACTAGTA CAATGCAGAG TCCGGAAGTT GAATGGGAAA AGACCCTCGG AAAAGGGATA GGTTATTCCG TCCAGCAGAC ATCGGATGGT GGCTACATTA TTGTAGGTTC CACACAATTT CGCGGCGCTG GCGATGTTTA TCTAATCAAG ACTGATGCCA ATGGCAATAA GCTTTGGGAA AAGACTTTTG GTGGAAGCGG TTCGGATGAA GGTTATTCTG TCCAGCAGAC GACCGATGGC GGCTACATTA TTGCAGGTTC CACGCATTCT TACGGTGGCG GTGACGATGA CGTATACTTG ATCAAGACTG ATGCCAACGG TAACAAGCTC TGGGAAAAGG TTTTTAAGGG AGAAGAGCTA ATTGAGGTCA AAGGCGGAAT AGCCCGGATA AAGACCTTAA AGGGAGAGCT GATTAAAGAG ATCGATATCA CCAAGGATTG GGAAAAGTAT TGGCCAGAGA CGTTAGGGGC AGAGCTGATT AACGAGGATA CTACCAACAA TTACTGGCTA TGGAAAAAGA CTTTAGGAGG AAAAGGGCGT TCCGTCCAGC AGACGGCCGA TGGGGGTTAC ATTATTGCGG GTTACACAAA CACTTACAAC GTTTATCTGA TCAAGACTGA TACTAATGGC GACACGCTTT GGGAAAGGAT CTTTGGGAGT AATTATACTG AAGTCTATTC CGTCCAGCAG ACGACCGACG GTGGCTACAT TATTGCAGGT TACATAGACC CTGGTAGTGT CGGGAAGGGT AACGTTTACC TGATCAAGAC CGACGCTAAA GGCAACATGG TCTGGGAGAA GACTTTCGGG GGAAGTAATT GGGATAAAGG CTATTCCGTC CGGCAGACGA CCGACGGTGG CTATATTATT GCAGGTTTCA CGCGCTCTTA CGGTGTCGGT AACGATGACG TATACTTGAT CAAGACTGAT GCCAACGGTA ACAAGCTCTG GGAAAAGAAC CTTGGGGGAA ATTATTGGGA GGGAGGCTAT TCCGTCCAGC AGACGACCGA CGGCGGCTAC ATTGTTGCAG GTGTAGGCGA TTATTCTCAG ATCAAGACCG ATGGCGACGG TAACTTGCTC TGGAAAAAGA CCTTAAGAGG GGAAGGACGT TCCGTCCAGC AGACGACCGA CGGTGGTTAC ATTATTGCGG GTTACACATT CTCTCGCAGT ACCGATAGTG ATGTTTACTT GATCAAACTT AAACCCGAAA CTCCCCCCGC AAACCAGCCT CCAGTGGTAA GTTTAAAGGA TATGCAGGGC CACTGGGCGG CCGACGCGGT GGACAGGCTG GTTGAGACGG GGGTTGTCTC CGGTTACCCA GACGGGACTT TCAGGCCCGA CCTGGAAGTG ACCCGTGCCG AAATTGCGGC TATCTTGGTG CGCGCCCTAA AGCTCACACC AACCAACAAT CAGGAGCTAA AGTTCAAGGA TGATGCAACC ATCCCGACCT GGGCCAAGGA CGCGGTAAGT ATAGCGGTTA AGGAAGGCCT GGTTAAGGGC TACCTTCAGC CGGATGGGAC AATGACCTTC GAAGCCGACC GCCCCGTCAC ACGAGCAGAA ATGGCTGTAT TAGTGGCGCG CGTCCTCCGG AAAAAACTCG GGGAGGTCAC CCCGATGGAG CTTAAATTCA CCGACGCTGT CATGATCCCG GCTTGGGCCA AATCGGACGT CGGCGTTGCT GTGGCGGAAG GCATCGTTGT CGGGTATCCC GACAATACCT TCCGTGCAGA GAACCATGTC ACCCGTGCGG AGGCTGCGGT AATGATCCTG CGGCTCCTAA GGGTGCTTGG CAGAATATAA
|
Protein sequence | MFWVCRYRLF IVPLLVFFLV SGFGLAWGAD ETSTMQSPEV EWEKTLGKGI GYSVQQTSDG GYIIVGSTQF RGAGDVYLIK TDANGNKLWE KTFGGSGSDE GYSVQQTTDG GYIIAGSTHS YGGGDDDVYL IKTDANGNKL WEKVFKGEEL IEVKGGIARI KTLKGELIKE IDITKDWEKY WPETLGAELI NEDTTNNYWL WKKTLGGKGR SVQQTADGGY IIAGYTNTYN VYLIKTDTNG DTLWERIFGS NYTEVYSVQQ TTDGGYIIAG YIDPGSVGKG NVYLIKTDAK GNMVWEKTFG GSNWDKGYSV RQTTDGGYII AGFTRSYGVG NDDVYLIKTD ANGNKLWEKN LGGNYWEGGY SVQQTTDGGY IVAGVGDYSQ IKTDGDGNLL WKKTLRGEGR SVQQTTDGGY IIAGYTFSRS TDSDVYLIKL KPETPPANQP PVVSLKDMQG HWAADAVDRL VETGVVSGYP DGTFRPDLEV TRAEIAAILV RALKLTPTNN QELKFKDDAT IPTWAKDAVS IAVKEGLVKG YLQPDGTMTF EADRPVTRAE MAVLVARVLR KKLGEVTPME LKFTDAVMIP AWAKSDVGVA VAEGIVVGYP DNTFRAENHV TRAEAAVMIL RLLRVLGRI
|
| |