Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2656 |
Symbol | |
ID | 7873397 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 2881532 |
End bp | 2882476 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643699579 |
Product | hypothetical protein |
Protein accession | YP_002889635 |
Protein GI | 237653321 |
COG category | [S] Function unknown |
COG ID | [COG1806] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000624696 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGACA TCGCCCGCCG CACCGTCTTC TTCGTCTCCG ACGGCACCGG CATTACCGCC GAAACCCTTG GCCACAGCCT GTTGGCGCAG TTTCCCGAGG CGCGCTTCCG CCAGGTGCGC GCCCCTTTCA TCGACGACAT CGACAAGGCG ATCGACTGCG CGACGCAGAT CCGCGAGGCT GCGGCCGAGG ACGGCGTGCG CCCGATCGTA TTCAGCACGC TGGTCAATCA GGCCACCGTT GACGCACTGC ACAAATCCGA CGCGCTCTTC CTGGACCTCT TCGAGCGCTT CATCGGCCCG CTCGAAGCCG AGCTCGGCCA GCGCTCCACC CATGCCGTGG GCCGCTTCCA TGGCATCGCC GACAGCCTCA ACTACAAGTA CCGCATCGAG GCGATCAACT TCGCCATGGC ACACGACGAC GGCATCTCGA GCGAGGGCGA GCTCGCCGAG GCCGACGTGA TCCTGGTCGG CGTGTCGCGC TCGGGCAAGA CCCCCACCAG CCTCTACCTG GCGATGCAGT TCGGCGTCAA GGCGGCCAAC TACCCGCTGA TCCCCGAGGA CTTCGAGCGC AACAAGCTTC CCGGCGAGCT GCACAAGTAC CGCACCAAGC TCTTCGGCCT CACCATCGCC CCCGAGCGCC TGTCGCAGAT CCGCCAGGAA CGCCGCCCCA ACAGCCGCTA CGCCGCGCTG GAGAACTGCC GCTACGAGAT CGACGCCGCG CACAAGCTGA TGCGCCGCGA GAACATCCGC TGCCTCGACT CGACCACCAA GTCGATCGAG GAGATCTCCG CCACCATCCT GCAGACCATC CGGGGTCGAC CGCCCCGGTT TCTGAGCGCG CGCCGGGCGC TTCGACCGGC GTTTTCCCGC GGCATTGGTA GAATACGCGG CCATGCGTGC CGCCCAAGCC GGCACCCCGA ACAGCGCAAC AGGAACCCGA CATGA
|
Protein sequence | MNDIARRTVF FVSDGTGITA ETLGHSLLAQ FPEARFRQVR APFIDDIDKA IDCATQIREA AAEDGVRPIV FSTLVNQATV DALHKSDALF LDLFERFIGP LEAELGQRST HAVGRFHGIA DSLNYKYRIE AINFAMAHDD GISSEGELAE ADVILVGVSR SGKTPTSLYL AMQFGVKAAN YPLIPEDFER NKLPGELHKY RTKLFGLTIA PERLSQIRQE RRPNSRYAAL ENCRYEIDAA HKLMRRENIR CLDSTTKSIE EISATILQTI RGRPPRFLSA RRALRPAFSR GIGRIRGHAC RPSRHPEQRN RNPT
|
| |