Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0301 |
Symbol | |
ID | 7085602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 345522 |
End bp | 347552 |
Gene Length | 2031 bp |
Protein Length | 676 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643697341 |
Product | putative ATP-dependent Lon protease |
Protein accession | YP_002353989 |
Protein GI | 217968755 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4930] Predicted ATP-dependent Lon-type protease |
TIGRFAM ID | [TIGR02653] conserved hypothetical protein [TIGR02688] conserved hypothetical protein TIGR02688 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAC CCGTAGATAC CGCCCGACTT GATGGGCTGC TCAACACGCA CTTCGCCGGC AAGGTGGTCC GCAAGGACCT GACCAAGCTG ATCAAGGAGG GGGCAAACGT TCCCGTTTAT GTGCTTGAGT ACCTGCTCGG GATGTACTGC GCGTCGGACG ACGAGGCGAC GATCCAGAAC GGGATGACGA TGGTCAAGCG CATCCTGGCC GAGAACTACG TGCGCCCGGA CGAGGCCGAG AAGATCAAGT CGAAGATCCG CGAGAGCGGC AGCTACAAGG TCATCGACAA GGTGACCGTG AAGCTCAACG AGAAGCGCGA TGTCTACGCG GCGCTGCTCT CGAACCTTGG GGTGAAGAAC GCTGAGGTGT CGGACGCCTT CGTTCGCCAG TTCGAAAAAC TGCTCGTGGG CGGCATCTGG TGCATCGTCA CCCTGAACTA CCTGTTCGAG GAGAACCAGC GCGGCTCACC CTTCACGGTG ACCGATCTCA AGCCGATCCA GATGCCCAAC ATGGACATGG CGGGCCTGTT CGAGGGGCGG CGGGCCTTCA CCGAGGACGA ATGGATCGAC GCGCTGATCC GCTCGACCGG CATGGAGCCG AGCTGCTTCA AGGAGCGGGT GAAGTGGCAT CTGCTCGCGC GGATGATTCC GCTGGTCGAG AACAACTACA ACTTCTGCGA GCTGGGTCCG CGCGGCACGG GCAAGAGCCA CATCTACAAG GAAATCAGCC CTAACAGCAT TCTGGTGTCA GGTGGGCAGA CGACGGTGGC CAACCTCTTC TACAACATGA GCGCGCGCAA GGTGGGCCTG GTGGGCCTTT GGGATACGGT CGCCTTCGAT GAGGTGGCTG GCATCAACTT CAAGGACCAC GACGGCGTGC AGATCATGAA GGACTACATG GCGTCCGGCT CCTTCAGCCG GGGCAGGGAG GCGATCAACG CCAATGCCTC GATGGTCTTC GTGGGCAACA TCAACCAGAC CGTCGAATCC CTGGTGAAGA CCAGCCATCT GCTCGCCCCG TTTCCCGAGG CCATGATCGA CTCGGCCTTC TTCGACCGGT TCCATGCCTA TGTGCCGGGC TGGGAGATCC CGAAGATGCG CCCGGAGTTC TTCACCAACC AGTACGGCCT GATCGTCGAC TATCTGGCCG AGTACCTGCG CGAGATGCGC AAGCGCAACT TCGCCGACGC GATCGACAAG TGGTTCAAGC TCGGCAACAA CCTCAACCAG CGCGACACCA TCGCCGTGCG CCGAACCGTC TCCGGCCTGC TCAAGCTGAT CTGCCCGCAC GGCGAGTACG ACAAGGAGAT CGTGCGCCGC TGCCTGGAGT ACGCGCTCGA GAGCCGTCGC CGGGTGAAGG AGCAGCTCAA GAAAATCGGT GGCATGGAGT TCTACGACGT CCACTTCAGC TACATCGACC TGGAGGCGGG CGAGGAGCGC TTCGTTACGG TGCCCGAACA ATCCGGCGGG GCGCTGATCG CGGAAGGCCG GCTCAACCCG GGTACCCTGC ACACCATCAG CCTGGGCGAC ACCGAGATGC CGGGCGCTTA TCGCATCGAG ATCCAGACCA TCGCCGGCGG CGGCAAGGTC TCGGCCTCGG GCCTTGCGCC GCGCGAGGCG GTCAAGGTCG CCTTCGATTA CTTCAAGGCC AACTCGGGCC GGGTGAGCGC ATCGATCAAG CCGGGCGAGC ACGATTTTCA CCTGCACTTG GTGGAGCTGC AGCACACCGG CACGCCGAAG GCGCTGACCC TGGCCGGGTT CATCGCCCTG TGCTCGGGTG CGCTCGCCAA GCCGGTTCAG AGCCAGATGG TGGTGATGGG CGACATGAGC CTCGGCGGAA CGGTCGTCCA GGTGCGCAAC CTGGCCGAGA GCCTGCAGGT GGCCTTCGAT GCCGGCGCAA AGCGGATCCT GCTGCCGATG TCGAGCGTGA CGGATATTCC GTCGGTGCCT GGGGAACTGT TCGCGAAGTT CCAGACGAGC TTTTACTCGG ATCCGGTGGA TGCGGTGTTC AAGGCGTTGG GGGTGGAGTG A
|
Protein sequence | MTEPVDTARL DGLLNTHFAG KVVRKDLTKL IKEGANVPVY VLEYLLGMYC ASDDEATIQN GMTMVKRILA ENYVRPDEAE KIKSKIRESG SYKVIDKVTV KLNEKRDVYA ALLSNLGVKN AEVSDAFVRQ FEKLLVGGIW CIVTLNYLFE ENQRGSPFTV TDLKPIQMPN MDMAGLFEGR RAFTEDEWID ALIRSTGMEP SCFKERVKWH LLARMIPLVE NNYNFCELGP RGTGKSHIYK EISPNSILVS GGQTTVANLF YNMSARKVGL VGLWDTVAFD EVAGINFKDH DGVQIMKDYM ASGSFSRGRE AINANASMVF VGNINQTVES LVKTSHLLAP FPEAMIDSAF FDRFHAYVPG WEIPKMRPEF FTNQYGLIVD YLAEYLREMR KRNFADAIDK WFKLGNNLNQ RDTIAVRRTV SGLLKLICPH GEYDKEIVRR CLEYALESRR RVKEQLKKIG GMEFYDVHFS YIDLEAGEER FVTVPEQSGG ALIAEGRLNP GTLHTISLGD TEMPGAYRIE IQTIAGGGKV SASGLAPREA VKVAFDYFKA NSGRVSASIK PGEHDFHLHL VELQHTGTPK ALTLAGFIAL CSGALAKPVQ SQMVVMGDMS LGGTVVQVRN LAESLQVAFD AGAKRILLPM SSVTDIPSVP GELFAKFQTS FYSDPVDAVF KALGVE
|
| |