Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1487 |
Symbol | |
ID | 7083569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1659268 |
End bp | 1660374 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698504 |
Product | hypothetical protein |
Protein accession | YP_002355141 |
Protein GI | 217969907 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCGAA GACGATTCAT CGCCGCCGCG GCGCTGGCCG GGCTGGGGGT GGCGGGCGGG ACCGGCGTCC TTTCCCATCG GCAGAGCGTG GCGGTCTACG AGGCAGCAGC GGGTGCGACC TGGCGCCATG GCGAGCCGGC GCAGCGGGAC ACGCGTGCGG CACTGCGCGA GCTCGTGCGC TACGCCACCC TCGCGCCGTC CAGCCACAAC ACGCAGTGTT GGAAGTTCGC GCTGGCGGAG CGGTCGGTTT CCATTCTCCC CGACTACTCG CGGCGCTGCC CGGTGGTGGA TCCCGACGAT CACCACCTCT TCGTCAGCCT GGGCTGCGCT GCCGAGAACC TCGTGCAGGC GGCGCCGGCG ATGGGTTTCC ACTGCGAGGC CGCCTTCGAC GGGGAGGCCG CCGAGGTCCT GAACTGCGCG CTGGCGCCCG CGCCGAGCCG ACGAACCGCC CTGTTCGAGG CCATTCCGCA GCGTCAATCG ACCCGCGGCG AGTTCGACTC GCGGCCGCTT TCGAACGAGG AGCTGCGGCG GCTCGAGGCG GCCGGCTCGG GCAGGGGCGT GAGCGTGCGC CTGCTCACCG GGGCGCAGGC GTTGGAACGC GTCCTGGAGT TCGTCGTCGC GGGGAACACG GCGCAGATGC GGGACCCGGC GTTCGTGCGC GAGCTGAAGG CGTGGATCCG CTTCAACGGC AGCGATGCGG CGCGCACCGG CGACGGCCTG TTCGCAGGGG CGTCCGGAAA CCCCTCCGCC CCGGCCTGGC TTGGCGGCCT GCTGTTCGAT GCCTTCTTCA CCGCGGCGGC CGAGAACGAC AAGTACGCGC GGCAGGTCCG CAGCGCCGCG GGGATCGCGG TGTTCGTCTC GGAGCGCGAC GATCGCGCGC ACTGGATCGA GGCGGGGCGT TGCTTCCAGC GCTTCGCGCT GCAGGCGACC GCGATCGGCG TCCGCACCGC GCATCTCAAC CAGCCGGTCG AGCTCGCCAC GCTGCGCCCC GCCTTTGCCG CCGACCTGGG CATCACGAAC GGCCGGCCGG ATCTGGTGAT CCGTTTCGGC AGGGGGCCGG CGATGCCCCG ATCCCTGCGC CGTCCGCTCG ACGCCGTGCT GCTGTGA
|
Protein sequence | MERRRFIAAA ALAGLGVAGG TGVLSHRQSV AVYEAAAGAT WRHGEPAQRD TRAALRELVR YATLAPSSHN TQCWKFALAE RSVSILPDYS RRCPVVDPDD HHLFVSLGCA AENLVQAAPA MGFHCEAAFD GEAAEVLNCA LAPAPSRRTA LFEAIPQRQS TRGEFDSRPL SNEELRRLEA AGSGRGVSVR LLTGAQALER VLEFVVAGNT AQMRDPAFVR ELKAWIRFNG SDAARTGDGL FAGASGNPSA PAWLGGLLFD AFFTAAAEND KYARQVRSAA GIAVFVSERD DRAHWIEAGR CFQRFALQAT AIGVRTAHLN QPVELATLRP AFAADLGITN GRPDLVIRFG RGPAMPRSLR RPLDAVLL
|
| |