Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1314 |
Symbol | |
ID | 3831801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | - |
Start bp | 1357702 |
End bp | 1359018 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 637829250 |
Product | hypothetical protein |
Protein accession | YP_430170 |
Protein GI | 83590161 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0407] Uroporphyrinogen-III decarboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.42967 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCGG AAGAAAAGCA GGAGCAAATG TTCGCCTCAT GGCTTTCGGG ACGGGGCTTC CCGTTTGTTA GCCCGGAAGC AGAAAAGGGT TATAAAGAAA GGGTAAGCCG GCTGAAATCC GCTATTCAAT TGCAGAAGGT GCCGGATCGA GTGCCGGTAT GCCCAATTAC CGGTTTTTTC CCTGCTTACT ACGGCGGAAT GACGGTTAAG GAAGCAATGT ATGATTATGA TAGAGCGGCA GCGGCCTGGA AGAAATATGT TCTGGATTTT GCACCGGATG CCTATTTAGG CCTCTTCCTG GATCCGCCGG GCAGATTTTT TGAAATCCTC GACTATAAAT TATATAAATG GCCGGGTCAC GGTACCCCTC CGGATACATC TTATCAGTGC CACGAGGCTG AGTACATGAA GGCCGATGAA TATAATGCAT TGATACAAGA TCCCTCCGAT TTTTGGGTTA GGATTTACCT TCCCCGCATC TTTGGCGCAC TTAAACCCTT AGAGAAATTA GCCCCCCTCA CCGACGTCGT GGAAATAGTG ATGACCTGCG GCAGTTTTCT GCCCTTCGGG TTGCCCGAGG TCCAGGAGGC TTTAAGGGCC CTTATTGCAG CGGGTAATGA GGTTCGTCGC TGGGCGGGCA TGCTGCGCGC CGTGGATCAA GCGCTGCAAG GGCAGGGGTT TCCTGCCTTT CTTGGTGGGG TTAGCAAGGC TCCCTTTGAT ATCATCGGCG ACACCCTTAG AGGCACTTAT GGGATCATGC TGGATATGTA TAGACAACCA GAGAAACTGC AGGAGGCCCT TGAAGCCGTT ACCCCCCTCG CCATAAGGAT GGGGGTGGCT TCGGCCAAGG CTGCCGGCCA TCCCTTGGTA TTTATGCCCC TCCATAAAGG TGCTGACGGT TTTCTGTCAG ATAAGCAGTA CAGGACTTTT TATTGGCCAA CGTTAAAGAA AGTTATTACA GGCTTAATTG AAGAAGGGCT AGTACCCTTC CTGTTTGCCG AAGGCAGCTA TAATTCCCGT TTAGAAGTCA TTCGCGATTT ACCGAAGGGA AAAACAGTAT GGCTGTTTGA TGATACCGAT ATGGCCAGAG CTAAAGAAAT TCTGGGAGAT ATAGCGTGTA TCGCCGGGAA TGTACCCATT ACCCTGTTAA GCCTTGGAAC ACCGGAAGAA ATAAAAGAAT ACTGCAAGCA GCTGATAAAA ACTGCCGGTA AGGATGGAGG TTTTATACTG AGCTCTGGAG CGACAATCGA TAATATAAAG CCGGAAAACC TGCGGGCGTT AATTGAGTCT GCCAGGGAGT TCGGCAGCTA CTCCTAA
|
Protein sequence | MTPEEKQEQM FASWLSGRGF PFVSPEAEKG YKERVSRLKS AIQLQKVPDR VPVCPITGFF PAYYGGMTVK EAMYDYDRAA AAWKKYVLDF APDAYLGLFL DPPGRFFEIL DYKLYKWPGH GTPPDTSYQC HEAEYMKADE YNALIQDPSD FWVRIYLPRI FGALKPLEKL APLTDVVEIV MTCGSFLPFG LPEVQEALRA LIAAGNEVRR WAGMLRAVDQ ALQGQGFPAF LGGVSKAPFD IIGDTLRGTY GIMLDMYRQP EKLQEALEAV TPLAIRMGVA SAKAAGHPLV FMPLHKGADG FLSDKQYRTF YWPTLKKVIT GLIEEGLVPF LFAEGSYNSR LEVIRDLPKG KTVWLFDDTD MARAKEILGD IACIAGNVPI TLLSLGTPEE IKEYCKQLIK TAGKDGGFIL SSGATIDNIK PENLRALIES AREFGSYS
|
| |