Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_0050 |
Symbol | |
ID | 3830800 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 50335 |
End bp | 51303 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637827982 |
Product | hypothetical protein |
Protein accession | YP_428932 |
Protein GI | 83588923 |
COG category | [S] Function unknown |
COG ID | [COG3584] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000119659 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTTTCTGC TGCGACAGCC TTCTGCAGGC CGGGGTCGTA AACTGGTGCT CCTGGCCTGC CTGGTGGCGG GGTTACAATC CCTCTTCCTG GTGGGCTGGA CCAGTAAAAA GGTTGATATC TACGCCGATG GTCAGGCCAG GCAGGTGACC GTTAACCAAT GGCTGGTAGG CGATTTCCAG TCCCGAAGCC AGGAAAACTT AAGGATTGGC GACTGGATTT TACCCCTTCC AGGAGGCTGG TTCTGGCCCG GCTTGAAGCT GTTTATGGCC AGGGGTACGC CTGTGGCGGC CGAGGTTGCC GGCCAAAATA TCTGGCCCCG GGAACCGGCC TCTGTGGCCA GTGAGCTCCT GAACAGGGAG GGTATAACCC TGGGGCCCGG GGACAGGGTG GAGACCAACC TGGGAAGCGA AGACCCCCAC CAGTACATCC GGGTAATTCG CGTCGAGGAT AGTATCGAAG TACAGCAGCA ACCAGTAGAC CCGCCGGTGG TGCGCCGGCC AGACCGTTAC CTGCCTCCGG GCCAGGAAAA GGTGGTACAG GAGGGGCAAC CAGGTGTCCG CTATTACAAA TACCAGGTGC GAAAAGAGAA TGGCGTTGAA GTCGAGCGCC GGCTGGTAGA CACGTGGGTT GAAATCCAGC CCAGCCCCAA AATAGTTGCC TACAGCAGCA GGGCTTATCC AGAGGTTACC GCCCGGGCCG GGGACACTTT GATGGTAATT GCGACGGCCT ATACCCATAC AGGCAACCGG ACGGCCACCG GGATCTGGCC CTATCGGGGC GTTGTTGCCG TAGACCCGCG GGTTATCCCC CTGGGGACCC GGCTCTATGT AGAAGGTTAT GGCTATGCCG TTGCCCAGGA TACCGGTGGG CTCATTAAAG GCAAGCGCAT CGATCTCTTT ATGGATAGCG CCGGGGAGGC GATGCGCTGG GGCCGGCGAC AGGTAACCGT TCGCATTCTC GGCGATTAG
|
Protein sequence | MFLLRQPSAG RGRKLVLLAC LVAGLQSLFL VGWTSKKVDI YADGQARQVT VNQWLVGDFQ SRSQENLRIG DWILPLPGGW FWPGLKLFMA RGTPVAAEVA GQNIWPREPA SVASELLNRE GITLGPGDRV ETNLGSEDPH QYIRVIRVED SIEVQQQPVD PPVVRRPDRY LPPGQEKVVQ EGQPGVRYYK YQVRKENGVE VERRLVDTWV EIQPSPKIVA YSSRAYPEVT ARAGDTLMVI ATAYTHTGNR TATGIWPYRG VVAVDPRVIP LGTRLYVEGY GYAVAQDTGG LIKGKRIDLF MDSAGEAMRW GRRQVTVRIL GD
|
| |