Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1700 |
Symbol | |
ID | 7084120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1907624 |
End bp | 1909318 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643698721 |
Product | protein of unknown function DUF88 |
Protein accession | YP_002355351 |
Protein GI | 217970117 |
COG category | [S] Function unknown |
COG ID | [COG1432] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00288] conserved hypothetical protein TIGR00288 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0123455 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCCT CCCCCGACAG CATCAGCATG GCGCTGTTCT GCGACTTCGA GAACGTGGCG CTGGGCGTGC GGGACGCGAA GTACGAGAAG TTCGACATCA AGCGGGTGCT CGAGCGCCTG CTGCTCAAGG GCAGCATCGT GGTCAAGAAG GCCTATTGCG ACTGGGACCG CTACAAGAGC TTCAAGGCGG CGATGCACGA GGCCAACTTC GAGCTCATCG AGATCCCCCA CGTGCGCCAG TCCGGCAAGA ATTCGGCCGA CATCCGGCTC GTGGTCGACG CGCTCGACCT GTGCTACACC AAGTCGCATG TCGACACCTT CGTGATCATC AGCGGCGACT CGGACTTCTC GCCGCTGGTG TCCAAGCTGC GCGAGAACGC CAAGCAGGTG ATTGGCGTCG GGGTCAAGCA GTCCACCTCC GACCTGCTGA TCGCCAACTG CGACGAGTTC ATCTTCTACG ACGACCTCGT GCGCGACAGC CAGCGCGCCG CAGCCAAGCG CGAGGCGCGC GACAACCCGC CCGCCAGCCG CAAGCAGCCC GAGGAGGACA AGGCCCGCCG CGAGGAGCTC GAGTCGCGCC GCAGCAAGGC GATCGACATT GCCGTCGAGA CCTTCGACGC GCTGCTCGCC GAACGCGGCG AGAGCGGCAA GATCTGGTCC TCGATGCTCA AGGAGGCGAT CAAGCGCCGC AAGCCCGATT TCAACGAGAG CTACTTCGGC TTCCGCGCCT TCGGCAACCT GCTCGAGGAA GCGCAGGCGC GCGGCCTGCT GGAACTCGGC CGCGACGAGA AGTCGGGCAT CTTCGTCACC CGCCCCGGCC CGGGGCTGGC CGCACCCCGC ATGCGCGAGG AGTTCACCAT GGCGGGCGAG TCCCTGGTGG TGGAGCGTGC CGCCCCGCGC ACCGAGCCGA CCGTGGGCGT CGCCGAGGCG GAGCCCGGCG CGGAGTACGC CAGCTCGCCT GCCGGCGCGG GCGGGGAGAC TTCCGCCAAC GCTTCGGAGC CGCGGCGCGG TCGCGGTCGC GGTCGTGGCC GCGCGGCACG AGCGCTGGAG GAGCCCGCGG CTGAAGGGGC GGCGGTCGAG GCGCATGTCG CCGCCACGGT GGAGGAGGGC GAGACCGCCA TCGAAAGCGC GCCGGCAGCG CCTTTCTTGC CGGCGGAGAC GGTGGTAGAG GCGACCTCCG AGGGCGCCGG GCACGCCGAG GACGCCGCGG TGGCGGTCGA GCCTCGCGCA GCCGAGGCCG AGCCCGAGTC CGCGCAGCCG AAGCGCTCGT CCGCGACGCG CAGGCGGCGT GGCGGCAAGG CCGAGGCGCC CGTCCTGCCG GCCGCCGCTG CGGCCACCGT CGCCGGCGCG GACCATCCGC CTGCTCCTGC CGCGGCGAAC GAAGCCCACG ACGCCGTGGC CGGCGCGGTG CAGGAGGAAA AGCCGAAGGG CAAAGGCAAG GGGAAGGACA AGCAGAAGCC GAAGGAGAAA CCCTCCGAGG CCGCAAAGCC GTCTTCGCGG CGTGGTCGCG GTGGGCGCGC TGCGGGTGCC AAGGCCGCTG CGGAAGGGGC GGGCGGGACG GCGACCGCGG CGGGCCCGTC GGCGAGCGCC GCGCCCCTGA CGAGCGCCGC GCCCGCCGCG AGCACGCCCG CGGCGGAGAA CCCCGCCCCG CGTCCGCGCC GCGCGCGCAA GCCGAAGGCC GCGTCGGCCG AATGA
|
Protein sequence | MASSPDSISM ALFCDFENVA LGVRDAKYEK FDIKRVLERL LLKGSIVVKK AYCDWDRYKS FKAAMHEANF ELIEIPHVRQ SGKNSADIRL VVDALDLCYT KSHVDTFVII SGDSDFSPLV SKLRENAKQV IGVGVKQSTS DLLIANCDEF IFYDDLVRDS QRAAAKREAR DNPPASRKQP EEDKARREEL ESRRSKAIDI AVETFDALLA ERGESGKIWS SMLKEAIKRR KPDFNESYFG FRAFGNLLEE AQARGLLELG RDEKSGIFVT RPGPGLAAPR MREEFTMAGE SLVVERAAPR TEPTVGVAEA EPGAEYASSP AGAGGETSAN ASEPRRGRGR GRGRAARALE EPAAEGAAVE AHVAATVEEG ETAIESAPAA PFLPAETVVE ATSEGAGHAE DAAVAVEPRA AEAEPESAQP KRSSATRRRR GGKAEAPVLP AAAAATVAGA DHPPAPAAAN EAHDAVAGAV QEEKPKGKGK GKDKQKPKEK PSEAAKPSSR RGRGGRAAGA KAAAEGAGGT ATAAGPSASA APLTSAAPAA STPAAENPAP RPRRARKPKA ASAE
|
| |