Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1807 |
Symbol | |
ID | 3830725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1860063 |
End bp | 1862135 |
Gene Length | 2073 bp |
Protein Length | 690 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637829734 |
Product | hypothetical protein |
Protein accession | YP_430650 |
Protein GI | 83590641 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000440945 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTAATA TAGTTACGCT GGTGAAAAAA GATAACGCTG GCAAATTCGA CTGGGCTCCG TTGTACGAGG CTTACGTCAG CGGCGCGAAC GGCGACCCGC TAAACGAACG CGATGATGGT TGGATGGATG GGTGCTGTCC TCTACATGAC GACACCAGAC CGAGCTTCTC TTTTAACCGC TGGTCCGGGT ACTGGCTCTG CCGGGCAGGG TGCGGCGCGG GTTGGCCGCT GGACTTTTTG GAGGCGGTCG CTGGCCTCGA CGGTGACGAT GCACTAGAGG AAATCCGGGA ATTGTGTGGG TCCTTGCCGC CGGAGATAAT CCCCGGCCCG CTTACCTTGA CGGCCTACGC TGTCTACTGT AACCTACCGG TATCCTTCCT TCAGAAGTTG GGACTGGCGG AATGTGCTAA AGGCGTCAAA ATACCCTACT GCACCGTGGA CGGCCAGGTT TTCCGCTACC GTTACCGCCT GAGTTTGAAT AAACAGGGCA GCCGGTTCGC TTGGGGTAGC GGCAAGGGTA TCCTGCCATA CGGTTTGGAG AACTTGCATT TGGCTAAGAA GGCAGGTTAT TTACTCCTGG TCGAGGGAGA ATCTGATGTG CAAACTCTGT TGTATGCTGG TATTCCCGCT CTAGGAAGTC CCGGAGTTGC TGCGTGGCAG AAGGAATGGT CAGCTTTAAT CCCTGAAGGA GTGCAAATTT ACATCTGGCA GGAACCGGAT GAGGGTCACG TCCTTGTGGA AAAGGTTTTA CGTGCCTTCC CGGACGCGAA AATTATTAAA ACCACTGTGG AGGAAAAAGA CCCCCGGCAG GTATGGTTGA ACTCCAGGGA TAAGGCAGAT TTCGTGCAGC TTATTAACGA GTTATTACAA ACAGCAACCA CGGCAGATGA CCTGCAGGAA GCCGCCAGGA GGACAGAGCG GGAAGCGGTC TGGCAGAAAT GTAAAGACTT AGCAATGGAG CCTGATATCT TAACACCTGT TTTAAATACC CTTAGTCAAG TCGTTGCTGG CGAACGCGAG GCCTTGGCCA TCCTTTACTT AGCGTTGACC TCACGGTTAT TACCTCGACC CATTAACCTC CTGCTTCAGG CCCCACCGGG GGCAGGCAAA AGTTATTTGG TAGACTGTGT TTTACAGATG TTTCCAGAAA GTGCATATTA TAAATTGACC GCTTCCAGTG AGCGGGCTTT TATTTATTCG GATGAAAATT TCGCTCACCG GACAGTTGTC GTAGCCGAAG CAGCAGGCTT GCATTCGGAT GGTGTCGCGG GAACAATTAT ACGTGAGTTA GTTTGGAGTA GCCAGTTGGC TTATGAAGTT GTAGAGAAAA CCCCTGACGG CCTGCGGCCA CGTAAAATTA TCAAAGATGG CCCAGTCGGC CTAATTACCA CGACTGTTAA AAATGTTGAA GGCGAGCTAG CCACACGCCT ACTGGTCGTG GAACTGAAAG ACACCCCGGA GCAAACCAGG CTAATTCTGG AAGCGGAAGC ACGGGAAGCT GCTGGACAGG CTACCATGCC AGATTTAAGC CATTTCGCGG CCCTACAGAA GTGGTTGGAG CTAAATGGGC CAGCGAATGT AATCGTACCC TACGCTGAAA CCCTAGCCAG GCTATTGAAG CCAAGCAGCG TACGGTTGCG CCGTGATTTC CGGCAGTTGT TGACGTTGAT AATGGCTAAC GCCGTGCTAC ACCGGGCGAG CCGCCAAACT TCTAGCAGTG GAGCGATAAT CGCCTCCATC GACGATTACG CGGCCATTTA CCCCCTGGCG GTTGCTCTGT TCGCCAGCAC TGGCGAAGCC ACCCTAACGC CGCAGCAACG TGAGGCGGTA GAAGCCGTTC GCCGGTATTA CGAGCAGTAC CATACGTCGG TCACTGTCAA GGCTTTAAGT AAATTACTGG GGATCGATAG GACTTCCACT CAACGCCGGG TAGCTGCGGC TATCAAGAAA GGTTTCCTGG TAAATCTGGA AGATAAACCC CACAGACCAG CGATGTTAGT GCCAGGCGAT ATGGCGGCTG AGGAAGACAA CTCTTTACCA GAGCCAGAGA TGGTAGCCAG GATGTCCAGC TAA
|
Protein sequence | MGNIVTLVKK DNAGKFDWAP LYEAYVSGAN GDPLNERDDG WMDGCCPLHD DTRPSFSFNR WSGYWLCRAG CGAGWPLDFL EAVAGLDGDD ALEEIRELCG SLPPEIIPGP LTLTAYAVYC NLPVSFLQKL GLAECAKGVK IPYCTVDGQV FRYRYRLSLN KQGSRFAWGS GKGILPYGLE NLHLAKKAGY LLLVEGESDV QTLLYAGIPA LGSPGVAAWQ KEWSALIPEG VQIYIWQEPD EGHVLVEKVL RAFPDAKIIK TTVEEKDPRQ VWLNSRDKAD FVQLINELLQ TATTADDLQE AARRTEREAV WQKCKDLAME PDILTPVLNT LSQVVAGERE ALAILYLALT SRLLPRPINL LLQAPPGAGK SYLVDCVLQM FPESAYYKLT ASSERAFIYS DENFAHRTVV VAEAAGLHSD GVAGTIIREL VWSSQLAYEV VEKTPDGLRP RKIIKDGPVG LITTTVKNVE GELATRLLVV ELKDTPEQTR LILEAEAREA AGQATMPDLS HFAALQKWLE LNGPANVIVP YAETLARLLK PSSVRLRRDF RQLLTLIMAN AVLHRASRQT SSSGAIIASI DDYAAIYPLA VALFASTGEA TLTPQQREAV EAVRRYYEQY HTSVTVKALS KLLGIDRTST QRRVAAAIKK GFLVNLEDKP HRPAMLVPGD MAAEEDNSLP EPEMVARMSS
|
| |