Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0344 |
Symbol | |
ID | 7085645 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 388733 |
End bp | 390247 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697378 |
Product | protein of unknown function DUF88 |
Protein accession | YP_002354026 |
Protein GI | 217968792 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.752457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAAAGCG CACTCTTCGT TGATTTCGAT AATGTCTATT CCGGACTGCG CAAGCTCGAT CCGGCAATGG CCGACCAGTT TGCGCAGAAG CCGCAGCGCT GGATGCAGTG GCTGGTGGCG TCCCTGGGTC TTCCCGAGCA CTCCCCCGAA AGTGCCCGGC GTCGCGTTCT GGTGCGGCGC TGCTACCTCA ATCCGCAGGT TTATCAGCGA TTTCGCCCCT CGTTCAACCT CGCGGGCTTC GAGATCATCG ACTGCCCGTC GCTCACGAGC GAGGGCAAGA CGAGCACCGA CATCCACATG GTGCTCGACA TCATCGATCT GCTGCAGCAC GAAGCGCATT ACGACGAATT CATCGTGTTC TCGGCGGACG CGGACTTCAC GCCCGTCCTG CGCAAGCTTC GTCGCTGGGA CCGGCGCACC ACCGTCCTTG CGATCGGATT CCCGTCGGCG GCCTACCGGG CTTCCGCCGA CCTGCTGATC GATCAGGATC TCTTCGTCCG CGATGCGCTT GGGTTCCGGG AAGAGGAATC CGGAGCGTCG GAGCCGGCGC CGGTGGCCGC ACCAGCCCTT GCGACGGCGG ACATCGTGCG CGCTGCGCGC GAGCTCATTC GGCGCAAAGT TGAGGAGTCC GTCGGGCCGG TTTCCCTTTC CCATCTCGCC TCGACCATTC TCAGGGACGT CGAAGGTGTT GACGCGACCA GTTGGGCGGG CTTCGGGAGC TTTCGGCAGC TTGTCGATGA GGCCCGTTTC GCGCCGCTGG TGGTGAGCTG GGAAGGCGGT GGGGTCATCT TCGATCCGGC CCGTCACGCG AAACCCGAGC CCGCAGCCAG GAAGGTGAAG GTCGAGGACG AAGTGAACGA CGTGACCCGG CTCATCCGGA CGGAGGTGGA GGCGTCAGCG CAACCGGTGC CATGCGCACG CATCGCCCAG CTCATCACGT CCAGGCATGG CGCAATCGCC AAGGACTGGA ACGGCCTGGG ATCCTTCCGC AAGATGGTGG AAGGCCTCAA CCTTGCGCCG ATCAAGGTGG ATTGGGCGGG TGCCGGAGGG CGCATGTACG ATCCTGCACG ACATCGCCTG ACCGCGCCCA ATGGGGCAAA GACGAACGGC AGCGCCATGC CGGACTGGGG CAAGGATGCC GATCTCCTGC CGATCGCATC CCAGATCCAT GATGTGACCA ACGCTCCGCT GTGGTCTCCG GCCGACTACC AGAACCTGTT CCGAATGCTC GCCGAGGACT TGGCGGCGCA GCCGTTCGAC CTCGCCGAGA CGGGCAAGCG CGTGCGCGAC AGGATCCGGG CGCAAGGGCG ACCCATCAAT CGCCAGAACG TGAACTGGAT CCTGCAGGGT TTGCTGTTCC GCGGGCACGT GTTCGGGAGG GGCGAGGATG ACTCCACGAC GCTGGCGCGC AAGTGCGCCG ACAACATCAA GTCCCTGTGC CTGCGCGAGC AGATGGTGAT CGACTCGGTG GTGGATGCGG CGATCATGCG GTGGATTGTC GGCGGCAGCG ACTGA
|
Protein sequence | MKSALFVDFD NVYSGLRKLD PAMADQFAQK PQRWMQWLVA SLGLPEHSPE SARRRVLVRR CYLNPQVYQR FRPSFNLAGF EIIDCPSLTS EGKTSTDIHM VLDIIDLLQH EAHYDEFIVF SADADFTPVL RKLRRWDRRT TVLAIGFPSA AYRASADLLI DQDLFVRDAL GFREEESGAS EPAPVAAPAL ATADIVRAAR ELIRRKVEES VGPVSLSHLA STILRDVEGV DATSWAGFGS FRQLVDEARF APLVVSWEGG GVIFDPARHA KPEPAARKVK VEDEVNDVTR LIRTEVEASA QPVPCARIAQ LITSRHGAIA KDWNGLGSFR KMVEGLNLAP IKVDWAGAGG RMYDPARHRL TAPNGAKTNG SAMPDWGKDA DLLPIASQIH DVTNAPLWSP ADYQNLFRML AEDLAAQPFD LAETGKRVRD RIRAQGRPIN RQNVNWILQG LLFRGHVFGR GEDDSTTLAR KCADNIKSLC LREQMVIDSV VDAAIMRWIV GGSD
|
| |