Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1455 |
Symbol | |
ID | 7083538 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1623221 |
End bp | 1624537 |
Gene Length | 1317 bp |
Protein Length | 438 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698473 |
Product | SurA domain protein |
Protein accession | YP_002355110 |
Protein GI | 217969876 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0760] Parvulin-like peptidyl-prolyl isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.882987 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAC TCCTTTTCCG CTCCCGCGTC GCAATCGCCA TCGGCCTCGC GGCCACCACG CTCGCGCTGC CGGTGCATTC CGCGCCGCGC GCGGTCGAGG TCGACCGCAT CGTCGCGGTC GTCAACAACG AGGTCATCAC CGGCCTCGAG CTGCGCGCGC GCATCGAGCA GACCCGCCGC CAGCTCGCCC GCCAGGGCGC CCAGCTTCCC CCCGAGGAAG TCCTGCAGCG CCAGCTGCTC GAGCGCCTGA TCGTCGAGCG TGCCCAGCTC CAGCTCGCGC GCGAGTCCTC GCTGCGCGTG GACGACGTCA CGCTCGACCG CGCGATCGAG CGCATCGCCT CGAACAACAA GCTCTCGATC GACCAGTTGC GCGCCACGCT GGAGAAGGAT GGCGTCACGT GGAGCCGCTT CCGCGACGAG ATCCGCAGCG AGATCCTGCT CACCCGCCTG CGCGAACGCG AGGTCGACAG CCGCATCGTC GTCACCGACG CCGAGATCGA CAACTTCATC GCCAACAACC CGGACGCCTT CTCCGGCCAG GAGTTCGCCG TCGCCCACAT CCTGCTGCGC ACGCCCGAGG GCGCCTCGCC GCAGCAGGTC GAGGCGGTGG CCAGGCGCGC CGAGCAGGTG ATGGCTCGGC TGCGCTCGGG CGAGGACTTC GCCCGCGTCG CCGCCGAGGT CTCCGACGCG CCCGACGGCC TGCAGGGCGG CAGCCTGGGC TGGCGCCCGC TCGATCGCCT GCCGGCCCTG TTCGCCGACG CCGTGCGCCG CATGCGCCCG GGCGAGACTT CGCCGGTGCT GCGCAGCGCC GCCGGCCTGC ACATCGTGCG CCTGGTCGAC GCGCGCGGCG GCGGTGCGGC GGCGGTGCAG AAGCTCGAGC AGACGCGCGC GCGCCACATC CTGATCAAGA CCTCCGAGGT CCTCTCCGAC GCCGACGCGG AGGCCCGCCT GCTGGCGATC CGCGAACGCG TGGTCAATGG CGCCGACTTC GCCGAGCTGG CCAAGGCGAG TTCGGCCGAC CTCTCCGCCG CCCGCGGCGG CGATCTCGGC TGGCTCAACC CGGGCGACAC CGTGCCCGAG TTCGAGCGCG CGATGAACGC GCTGCGGCCG GGCGAGGTCA GCGCGCCGGT GCGCTCGCCT TTCGGCTGGC ATCTGATCCA GCTCGTGGAG CGCCGCATGC AGGACGTCAC CGACGAGCGC AAGCGCGCCG CCGCGCGTCA GGCCCTGCGC GAACGCAAGG CCGAGCAGGC CTACGAGGAC TGGTTGCGCC AGCTGCGCGA CAGCACCTAC GTCGACTACC GCATCGAGCG CGAATGA
|
Protein sequence | MKRLLFRSRV AIAIGLAATT LALPVHSAPR AVEVDRIVAV VNNEVITGLE LRARIEQTRR QLARQGAQLP PEEVLQRQLL ERLIVERAQL QLARESSLRV DDVTLDRAIE RIASNNKLSI DQLRATLEKD GVTWSRFRDE IRSEILLTRL REREVDSRIV VTDAEIDNFI ANNPDAFSGQ EFAVAHILLR TPEGASPQQV EAVARRAEQV MARLRSGEDF ARVAAEVSDA PDGLQGGSLG WRPLDRLPAL FADAVRRMRP GETSPVLRSA AGLHIVRLVD ARGGGAAAVQ KLEQTRARHI LIKTSEVLSD ADAEARLLAI RERVVNGADF AELAKASSAD LSAARGGDLG WLNPGDTVPE FERAMNALRP GEVSAPVRSP FGWHLIQLVE RRMQDVTDER KRAAARQALR ERKAEQAYED WLRQLRDSTY VDYRIERE
|
| |