Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1905 |
Symbol | |
ID | 3831178 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1969828 |
End bp | 1970853 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637829838 |
Product | hypothetical protein |
Protein accession | YP_430748 |
Protein GI | 83590739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000000983777 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCC CCGGTAATTG TTTAACAACG GCCATGGGTA TTCTTCCCCA TGTCGATCCG GATACAGCCT TGAGACTGGC CCTGTCCCTG GATATACCCT TCTGGCCGCA ACTACCCAGA CTGAGCTTTT ATGAAGATAT GTATGCCCAG GTATCAGAGC ACCTACCCGG CATTAAACTG GATTTTAACG CGCAGGTTAT CCGCTTCAAT ATAAAAGACT TTTACGAAGA ACTTCCTGCT TATATCGAGA ACTGGGAAGA TCCGGATTAC TTTCGCCTCT CACCCAGGTA CGCCAGGGTT TACCGGCGTT TTTTGGCGGA AGATTTAGCT CACTATCCCG CCATTCGCGG CCAGTCAATC GGCCCCGTTA GCTTTGGTCT CAAAATAGTT GACGAAAAAC AGGCGCCGAT CATCTATAAC GACGAAGTCA GGGGGTTCCT CTATGATTTT ATAACTAAAA AAATCCAGGC CCAGTACCGG GACCTGGTAG CCAAAAACCC CCGGGCTTTT GTCTGGGTAG ATGAACCGGG CCTGGAACTG GTCTTTATGG CCCTTACCGG CTACAGTTCG GAAAGAGCCA GGGAGGATTA CCGCCATTTC CTGGCCATGC TCCCCGGACC TAAAGGGGTA CATCTCTGCG GCAATCCCGA CTGGTCCTTT TTGCTGGGCC TGGAACTTGA CATTATCTCC CTCGATGCCC TGCAGTGGGG GCACATTTTT ACCCGTTACA CCGGAGAAGT AAAAGAATTC CTGCAAAGGG GCGGTATTAT CTCCTGGGGC ATCACCCCCA CCCTGACCGA AGAGGTAGAA AAGGTAACGA TTGCAAAGCT GGTAGCCCAA TTGGAAAATC TCTGGGATTA TTTGCACGGG CAGGGGATTA GCAAAGAAAC TATTATAACC CAGGCCTGGC TGGCGCCGGC CCGCTGCTGC CTGGTAAATG CCGACGGCGC CGCTTCCGTC GAAAAATCCT TCCGGCTGCT CAAAGAAGTT GCCGGGGTAA TCAGGAAGAA ATACGGGTTA TTATAA
|
Protein sequence | MKLPGNCLTT AMGILPHVDP DTALRLALSL DIPFWPQLPR LSFYEDMYAQ VSEHLPGIKL DFNAQVIRFN IKDFYEELPA YIENWEDPDY FRLSPRYARV YRRFLAEDLA HYPAIRGQSI GPVSFGLKIV DEKQAPIIYN DEVRGFLYDF ITKKIQAQYR DLVAKNPRAF VWVDEPGLEL VFMALTGYSS ERAREDYRHF LAMLPGPKGV HLCGNPDWSF LLGLELDIIS LDALQWGHIF TRYTGEVKEF LQRGGIISWG ITPTLTEEVE KVTIAKLVAQ LENLWDYLHG QGISKETIIT QAWLAPARCC LVNADGAASV EKSFRLLKEV AGVIRKKYGL L
|
| |