Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1647 |
Symbol | |
ID | 3830935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1682950 |
End bp | 1683987 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637829572 |
Product | hypothetical protein |
Protein accession | YP_430492 |
Protein GI | 83590483 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0171139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCAT CACGGGCATT GCGTATCTGG CGCCTGGCCG GGGCCGGCAT CTTCCTGGCC GGGGCCTTGT TTTTTCTCTA CCGGGTGCGC CAGGTCCTGA CGCCCTTTAT CCTGGCAGCC CTATTGGCTT ACCTGTTGAA ACCGGCCGTG CTGGCCCTGG AAAAGAGGGG TGTTAAACGT CCCCGGGCCA TTTTAATCCT CTACCTTTTT ATCCTGGCCC TGTCCCTGCC GGTATTCTTC TTCGTCTTAC CGCAACTGGT ACGCGAATTA AATGAATTCA TCGCCCAGCT ACCTTCCTTT ACGGTGGAAA TAGAAGGCCT GGTCCAGGGC TTTTACCAGC GCTACCACCA GGTGGCCCTG CCCGCCGGCC TGCGCCGGCT GGTGGACGAC TCGATAACGA ACGTCAGCAG TGCCCTCCAG GAGGGTGCCC GCCACGCCGT CCAGGCCCTG ATCGATTTGC TGGCAGGGTT GGCCAGTTTT CTCCTGGCAC CGGTCCTTGC CTATTATCTG CTGCGGGACA GTGAGCAGAT CGGCCGCGCC GCCAGCCACC TGCTACCCAT CCAGGTGAAG GAGGACATCC TGGGACTATG GGCGGAGATC GACCAGGTAC TGACCAGCTT TATTCGCGGC CACTTGCTGG TATCCCTCAT TGTCGGATGC CTCACGGGGG TGGGACTGGC CCTGACTGGT TCCGAGTACG CGGTAATCCT GGGGGTTGTG GTCGGTCTGG CTGACTTAAT CCCCTACTTC GGTCCCCTCA TCGGCACCGT ACCCGTTATA GCCCTTTCCC TGCTGGTATC CAAAAAGGCG GCCATCATGG CCCTGGCTGT AATGCTGGTC GTCCAGCAGA TTGAGGGCAG CTTTCTGGCC CCCAGGATCC TGGGGACCAG CGTCGGCCTG CACCCTTTAA TTATCATTTT TGCCCTCCTG GCCGGGGGTG AGCTCTGGGG TGCAGCCGGC CTCATCCTGG CCGTACCCCT GACGGCCATC GGCTATATTT TAGTGAAATT CATTTGGGCC CGCCTGGTAA GCAGTTAA
|
Protein sequence | MTASRALRIW RLAGAGIFLA GALFFLYRVR QVLTPFILAA LLAYLLKPAV LALEKRGVKR PRAILILYLF ILALSLPVFF FVLPQLVREL NEFIAQLPSF TVEIEGLVQG FYQRYHQVAL PAGLRRLVDD SITNVSSALQ EGARHAVQAL IDLLAGLASF LLAPVLAYYL LRDSEQIGRA ASHLLPIQVK EDILGLWAEI DQVLTSFIRG HLLVSLIVGC LTGVGLALTG SEYAVILGVV VGLADLIPYF GPLIGTVPVI ALSLLVSKKA AIMALAVMLV VQQIEGSFLA PRILGTSVGL HPLIIIFALL AGGELWGAAG LILAVPLTAI GYILVKFIWA RLVSS
|
| |