Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Moth_1597 |
Symbol | |
ID | 3832743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Moorella thermoacetica ATCC 39073 |
Kingdom | Bacteria |
Replicon accession | NC_007644 |
Strand | + |
Start bp | 1633017 |
End bp | 1634192 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637829526 |
Product | HEAT repeat-containing PBS lyase |
Protein accession | YP_430446 |
Protein GI | 83590437 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000656599 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.759481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAA AAAGGAGCCG AAAGCAATTC CCCATGGAAA ATGAAAAAAA ATTAAGGGAA CACCACCCCA TAAAACAAAC TCCAGTCAGG GAACCGGTCA GTAAACCCCT TTGCCCTTTC TGCGGCTTGC CCATTGCTAA ACCGCGGGAA CTGGCGACCC ACCGCCCGGG GGAGATGCCC GTCGGTTCCT GTTCCTGCGG TGCTGTTTAT GCCTGCGACG TCACCGGCCA TAACCTGGGG GCCGCCTTTA TTGAAGCCCT GGTTTTCGGT TGTAACCAGG ATTGGGACCT GGCCTGGGGA TTGTTACCGG AAGAAGATTA CCTGGAAAGA CTGGTGGAAC ATTACGATTA TGATAGCCAC CTCATTGTTC CGGAGGGAGT TTATGAGGGC CGCAGGGTGA CCGGGGCCCT CTATTTTGTA CGGTTGCAAG CAGATATCCG GGAAGTAACC GGGCCTGGAG TGCAGAGGAA ATTAAGGGAG GCCGCTACCC CTGGGGCAGA TAGGCCAGGC TACTGCCACC CTGCCGGTGC CAGGAAATAT AGTAAAAAAG AGATTACCCA ACTGGTAAAG GGATATCACC TTGAAACTCT GGTGGATATA GCCCGGCAGG ACAGGCGGGT CCTCGGCGAT CTCCAGCGCC TCCTCTGCTC GGGTGACGCC CTCCTACGCC TGCAGGCGGC AGACATTCTC GGCCGGGCTG CTGCCGTCTA TGCTACCGGC GCGCCGGAGA TCATTGCCAA TCTACTCCAG CGCCTCCTGG CCTCCGTCTC CGATCCCGGT GCCGCCAGCT GGGGCGCCGT TGATGCCATG GGCGAAATTA TCGCCAACGC TCCAGCTACC TTTGCCGGGT ACATCCCTCA GCTCTATCCC TTCCTGGAAG ATAAAATCCT CCGGCCCAGG GTACTGCGGG CCTTAGGCAG GATCGCCGGG GTTAAACCGG GGCTTTTACA GCGGGCAGCC CTGCATTTTT TAACCTTCCT CAGGGACCCC GACCCGGAAA CCAGGGGTTA TGCCGCCTGG CTCCTGGGCA TCCTGAGAAC GGAGGCCGCC CGGGAGGATC TAAAAGGGCT TCTCGATGAT CGGCAGGTCG TCGCCGTCTA TCATAACGGC GCCATTGATG AAAAGACGGT GGGGGAACTG GCCGCCGCAG CCCTGGACCG GCTGGCGGGA GCTTAA
|
Protein sequence | MNKKRSRKQF PMENEKKLRE HHPIKQTPVR EPVSKPLCPF CGLPIAKPRE LATHRPGEMP VGSCSCGAVY ACDVTGHNLG AAFIEALVFG CNQDWDLAWG LLPEEDYLER LVEHYDYDSH LIVPEGVYEG RRVTGALYFV RLQADIREVT GPGVQRKLRE AATPGADRPG YCHPAGARKY SKKEITQLVK GYHLETLVDI ARQDRRVLGD LQRLLCSGDA LLRLQAADIL GRAAAVYATG APEIIANLLQ RLLASVSDPG AASWGAVDAM GEIIANAPAT FAGYIPQLYP FLEDKILRPR VLRALGRIAG VKPGLLQRAA LHFLTFLRDP DPETRGYAAW LLGILRTEAA REDLKGLLDD RQVVAVYHNG AIDEKTVGEL AAAALDRLAG A
|
| |