Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4062 |
Symbol | |
ID | 7873289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 4461266 |
End bp | 4463107 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700993 |
Product | signal peptide peptidase SppA, 67K type |
Protein accession | YP_002891016 |
Protein GI | 237654702 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0616] Periplasmic serine proteases (ClpP class) |
TIGRFAM ID | [TIGR00705] signal peptide peptidase SppA, 67K type [TIGR00706] signal peptide peptidase SppA, 36K type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.244218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCGGG GCCTGTTCCG CTTCCTATTG AACGTGCTGC GCATGTTCGT GCGCGCGCTC GACCTCGTCG TGCGCGGCGT CTTCTACGCC CTGCTGATCT TCGGCCTCGG TGTGCTGGTG TCCTTCTTCT TCCACCCCGA GCCCGAGGTG CAAGCCGGGT CCGCGCTGGT GCTGCGACCG GTGGGCACGA TCGTCGAGCA GGCCGAGCTC GAGCCACCGC TGGCGCTGCT GCGCGCCGGG GGCGCACCGG CCGGCCAGCT GCGTCTGGCG GATCTGGTGG ACGCGGTGCG CAAGGCGCGC GACGACGCGC GCATCGCCGC GCTGGTGATC GAGACCGACG AGCTGGTCGG CGGCGGCTTC TCCAAGCTTG CCGAGCTGCG CGCCGCGATC GCCGATTTCA AGGCCTCGGG CAAGCCGGTG CTGGCGCGCG GCGAGCGCTT CACCCAGTCG CAGTACTACC TCGCCTCGGT GGCCGACGAA CTCCACCTGT CGCCGGACGG CTTCGTGCTG CTGCGCGGCC TGGCGCGCTA CGGCACCTAC TTCCGCGACG CGCTCGACAA GCTCGGGGTC AAGGTGCACG TGTTCCGCGT CGGCGAGTAC AAGTCCTTCT CCGAGCCCTT CACCCGCAGC GACATGTCCG ACGAGGACCG CGAGGCCACC CGCGACCTCC TCGACGGGCT GTGGCGCTTC ATGCGCGACG ACATCGCCGC CAGCCGCAAG CTCGCGCCCG CGGCGATCGA CGCGCATGTG AACGACATCC GCGGCGCGCT CGCTGCAGCC GGGGGCGATG CCGCCAAGGC CGCGCTCGCC GCCGGCCTGG TGGACCGCTT CAGCACCCGC GATGAATGGC GCGCCCGCCT GATCGAGGCC GTCGGCACCG ACCACGAGGG CAAGGACGTG CGCACCATCG AGGCCGAGGC CTACCTCGCG CTCGCTGCGG ACGACACCCG TCACGCCGCC GGCAGCGTGG CGGTGATCGT CGCCCAGGGC ACCATCGTCG ATGGCGCCGA GCCGGCCGGC GTGGTCGCCG GCGACACCTT CGCCCGCCTG ATCCGCGAGG CGCGCGAGGA CGAGGACATC AAGGCGCTGG TGCTGCGCAT CGACAGCCCG GGCGGCAGCG CCTGGGCCTC GGAGCTCATC CGTCGCGAAC TCGAACTCAC CCGCCAGGCC GGCAAGCCGG TGATCGCGTC GATGAGCTCG GTGGCGGCCT CGGGCGGCTA CTGGATCGCC ACCGGTGCCG ACGAGATCTG GGCGGCGCCT TCCACGGTGA CCGGCTCGAT CGGCATCTTC GGCCTCTTCC CGGAGTTCTC CGAGCCGCTG CGCCGCCTCG GCATCGGCGT CGATGGCGTC GCCACCGCGC CGCTCGCCGG CGCGCTCGAC CCGCGTCGCC CGCTCGACCC GGCCGCGGCC GAGGCCATGC AGCTCGGCAT CGAGCACGGC TACCGGCGCT TCCTGGAGGT CGTCGCGCAG GCGCGCAAGC TGACGGTCGC GGAGGTCGAC GCGGTCGCTC GCGGTCGCGT GTGGACCGGA GAGGCGGCGA GCGGCCTCGG CCTGGTCGAC AAGCTCGGCA GCCTGGACGA CGCGATCGCC GCCGCGGCCG CGCGCGCCGG CCTCGCCGAG CATCAGGTGG TGTGGCCGGC GGCGGGCGAG TCCCTCGAGC AGCGCGTGTT GCGCCGGCTG CTGCGCACTG GCGAGGAACT CGGCATCGAC CTGGCCGGCC GCAGCGCGCC GGCCGCGCCG CTCGCCGCCG CGGCCGCGGA CGTGGAGCGC GCCGCCCGCG CGCTGCTGCG CTGGAACGAT CCGCGCCACC ACTACCTGCA CTGCCTGTGC GACGCGCCTT GA
|
Protein sequence | MIRGLFRFLL NVLRMFVRAL DLVVRGVFYA LLIFGLGVLV SFFFHPEPEV QAGSALVLRP VGTIVEQAEL EPPLALLRAG GAPAGQLRLA DLVDAVRKAR DDARIAALVI ETDELVGGGF SKLAELRAAI ADFKASGKPV LARGERFTQS QYYLASVADE LHLSPDGFVL LRGLARYGTY FRDALDKLGV KVHVFRVGEY KSFSEPFTRS DMSDEDREAT RDLLDGLWRF MRDDIAASRK LAPAAIDAHV NDIRGALAAA GGDAAKAALA AGLVDRFSTR DEWRARLIEA VGTDHEGKDV RTIEAEAYLA LAADDTRHAA GSVAVIVAQG TIVDGAEPAG VVAGDTFARL IREAREDEDI KALVLRIDSP GGSAWASELI RRELELTRQA GKPVIASMSS VAASGGYWIA TGADEIWAAP STVTGSIGIF GLFPEFSEPL RRLGIGVDGV ATAPLAGALD PRRPLDPAAA EAMQLGIEHG YRRFLEVVAQ ARKLTVAEVD AVARGRVWTG EAASGLGLVD KLGSLDDAIA AAAARAGLAE HQVVWPAAGE SLEQRVLRRL LRTGEELGID LAGRSAPAAP LAAAAADVER AARALLRWND PRHHYLHCLC DAP
|
| |