Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1417 |
Symbol | |
ID | 7083499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1577717 |
End bp | 1580176 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698434 |
Product | peptidase S16 lon domain protein |
Protein accession | YP_002355072 |
Protein GI | 217969838 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.185581 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGCTC CGCTCACGCC GCCGGCCCCC TTGCCCGTCG AGCGCCTGGC GACTCGCTGC GACCCGGCAG CGCTCGGCAT CGAAACCTCC GCCCAACTGC CCGAGCTCGA CGTCGCCCGC CTCCACGGCC GCGCGGTCGA CGCGATCCGG CTCGGCCTCG ACATCCGCGC CGAGGGCTAC AACCTCTTCG TGCTCGGCGA CCCCGGCAGC GGCCGCCACG AGCTCGTCCG CCGCCTGCTC GAGGACACGC GCGGCCGCGG CGATGCACCC GCGGACTGGT GCTACGCATG GAACTTCGCC AACGCCGCCC AGCCTCGGCT GCTGCGCCTG CCCTGCGGGC GCGGCCCGGC GCTGCGCGAC GATCTGGCCC GCTTCGTCGA GGAGCTGGTG CCGGCGATCG GCGCGGTGTT CGAGAGCGAG GAGCACCGCA ACCGCATCGA GGCCCTGCAG GAGGAGGCCA AGACGCGCGA GGAGAGCGCC TTGCGCAGCC TCGGCGACGA GGCCCAGAAG CTCGGCGTCG CGCTGCTGCG CACCCCGCAC GGCTTCGCCT TCCTGCCGAT GAAGGACGAG GGCAGCACGC TCACGCAGGA AGAATTCGAG CAGCTGCCCG AGGCACGCCA GCACGAGCTC GGCGAGCACA TCCGCGCGCT GCACGAGCGC CTGCACCGCC TGATGGGCGA TTTCCCGCGC TGGCGGCACG AGCTGCAGAA CCGCATCCGC GACGCCGGGC GCGAGGCGAT CCGCGCCACC GTCACCCACA TGGTCGACGA ATTGAAGGCG CGTCACGCAG ACCTGCCCGA GGTGTGCGCG CATCTGGACG CGGTGCTCGC CGACGTCGTC GCCAGCGGCG AGTCGCTGCG CGCGACGCCG CACGCCGACG AGGACAGCGA GACGCTGACC TACACCGGCA GCATCAGCGT GCAGCGCTAT CTGGTGAACC TGCTCGTCGC CAACCCCGCC GACGGCACCC GACCGATGGT GTACGAAGAC CACCCCACGC TGCAGAACCT GGTCGGCCGC ATCGACCACC TGGTGCACAT GGGCACCCTG GTGAGCAACT TCACCCTGAT CCGCGCCGGC GCCCTGCACC GCGCCAACGG CGGCTTCCTG GTGCTCGATG CGCTCAAGCT GCTGTCCCAG CCCTTCGCCT GGGAGGGGCT CAAGCGCGCG CTCAAATCCG CACGCCTGCG CATCGAGTCG CTCTCGGAGC TGATCGGGGT GACCGGCTCG GTGCAGCTCG AGCCCGAGCC GATGCCGCTC GAGCTCAAGG TGGTGCTGAT CGGCGATCGC CTGACCTACT ACCTGCTCGG CCGCTACGAT CCCGAGTTCG CCGCGCTGTT CCGCATCAAC GCCGACATGG AGAGCGAGAT CGAACGCTCC GCGGACAACA CCGCCGCTTA CGCCTGCCTG CTCGCCACCC TGGCGCGCCG CGCCGGCCTG CCGCCCTTGT CGGCGCCGGC GCTGGCGCGC CTGATCGAGC ACGCGGCGCG CCTCGCCGCC GACGCCGAGC GCCTGAGCGC GCGCACCCAG CCGCTCGACG ACCGCCTGCG CGAGGCGGCC CACTTCGCCC TCGCCGCCGG CGCCGCGCGC ATCGAGTGCG AGCATGTGGA CGCCGCGATC GCGGCCCACC GGCGCCGCCA CGAACGCATC CGCCTCGGCC ACCTCGACCA GATCCTGCGC GGGCAGTGGC TGATCGACAC CGCGGGCAGC CACGTCGGCC AGGTCAATGG CCTGGCGGTG GTGCCGCTCG GCGAAGACAG CTTCGCCCAC CCGCAGCGCA TCACCGCCAC GGTGCGCGCC GGCGCCGGCG AGGTCATCGA CATCGAGCGC GAGGTCAAGC TCGGCGGGCC GATCCACTCC AAGGGCGTGC TGATCCTGTC CGCCTTCCTC GCCGCACGCT TCGGCTGGAT GCTGCCGCTC TCGCTCAAGG CGAGCCTGGT GTTCGAGCAG TCCTACGGTG GCGTCGAGGG CGACAGCGCC TCGCTCGCCG AGCTGGTGGC CCTGCTCTCG GCGCTCTCGG GCGTGGCGGT GAAGCAGTCG CTCGCCGTGA CCGGTTCGGT GAACCAGTTC GGCGTCGTGC AGCCGGTAGG CGGGATCAAC GAGAAGATCG AGGGCTTCTT CGACCTGTGC GCGGTGCGCG GTCTCGACGG CCGCCAGGGC GTGCTGATTC CGCGCGCCAA CGTCTGCCAC CTGATGCTGC GCGACGACGT GGTCGAGGCG GTGCGGGCCG GGCGCTTCGC CGTGTGGGCG GTGGCCGACG CCGACGAGGC GGTGGAGCTG CTCACCGGCG TGGCGGCGGG CGTGCCGGAC GAGCAGGGCA GGATGGCTGC GGGCTCGATG AGCCGGCGCG TCGTCGAGGG CCTGCGCAAG CTGGCGAGGA TGCAGCGCGA GTTTGCGCGG CGGGGACACG AAGACGACGC CGACGCGGCC GGGCATCGAG GCCGGCCGCA CGCATCCTGA
|
Protein sequence | MSAPLTPPAP LPVERLATRC DPAALGIETS AQLPELDVAR LHGRAVDAIR LGLDIRAEGY NLFVLGDPGS GRHELVRRLL EDTRGRGDAP ADWCYAWNFA NAAQPRLLRL PCGRGPALRD DLARFVEELV PAIGAVFESE EHRNRIEALQ EEAKTREESA LRSLGDEAQK LGVALLRTPH GFAFLPMKDE GSTLTQEEFE QLPEARQHEL GEHIRALHER LHRLMGDFPR WRHELQNRIR DAGREAIRAT VTHMVDELKA RHADLPEVCA HLDAVLADVV ASGESLRATP HADEDSETLT YTGSISVQRY LVNLLVANPA DGTRPMVYED HPTLQNLVGR IDHLVHMGTL VSNFTLIRAG ALHRANGGFL VLDALKLLSQ PFAWEGLKRA LKSARLRIES LSELIGVTGS VQLEPEPMPL ELKVVLIGDR LTYYLLGRYD PEFAALFRIN ADMESEIERS ADNTAAYACL LATLARRAGL PPLSAPALAR LIEHAARLAA DAERLSARTQ PLDDRLREAA HFALAAGAAR IECEHVDAAI AAHRRRHERI RLGHLDQILR GQWLIDTAGS HVGQVNGLAV VPLGEDSFAH PQRITATVRA GAGEVIDIER EVKLGGPIHS KGVLILSAFL AARFGWMLPL SLKASLVFEQ SYGGVEGDSA SLAELVALLS ALSGVAVKQS LAVTGSVNQF GVVQPVGGIN EKIEGFFDLC AVRGLDGRQG VLIPRANVCH LMLRDDVVEA VRAGRFAVWA VADADEAVEL LTGVAAGVPD EQGRMAAGSM SRRVVEGLRK LARMQREFAR RGHEDDADAA GHRGRPHAS
|
| |