Gene Tmz1t_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1700 
Symbol 
ID7084120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1907624 
End bp1909318 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content72% 
IMG OID643698721 
Productprotein of unknown function DUF88 
Protein accessionYP_002355351 
Protein GI217970117 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0123455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTCCT CCCCCGACAG CATCAGCATG GCGCTGTTCT GCGACTTCGA GAACGTGGCG 
CTGGGCGTGC GGGACGCGAA GTACGAGAAG TTCGACATCA AGCGGGTGCT CGAGCGCCTG
CTGCTCAAGG GCAGCATCGT GGTCAAGAAG GCCTATTGCG ACTGGGACCG CTACAAGAGC
TTCAAGGCGG CGATGCACGA GGCCAACTTC GAGCTCATCG AGATCCCCCA CGTGCGCCAG
TCCGGCAAGA ATTCGGCCGA CATCCGGCTC GTGGTCGACG CGCTCGACCT GTGCTACACC
AAGTCGCATG TCGACACCTT CGTGATCATC AGCGGCGACT CGGACTTCTC GCCGCTGGTG
TCCAAGCTGC GCGAGAACGC CAAGCAGGTG ATTGGCGTCG GGGTCAAGCA GTCCACCTCC
GACCTGCTGA TCGCCAACTG CGACGAGTTC ATCTTCTACG ACGACCTCGT GCGCGACAGC
CAGCGCGCCG CAGCCAAGCG CGAGGCGCGC GACAACCCGC CCGCCAGCCG CAAGCAGCCC
GAGGAGGACA AGGCCCGCCG CGAGGAGCTC GAGTCGCGCC GCAGCAAGGC GATCGACATT
GCCGTCGAGA CCTTCGACGC GCTGCTCGCC GAACGCGGCG AGAGCGGCAA GATCTGGTCC
TCGATGCTCA AGGAGGCGAT CAAGCGCCGC AAGCCCGATT TCAACGAGAG CTACTTCGGC
TTCCGCGCCT TCGGCAACCT GCTCGAGGAA GCGCAGGCGC GCGGCCTGCT GGAACTCGGC
CGCGACGAGA AGTCGGGCAT CTTCGTCACC CGCCCCGGCC CGGGGCTGGC CGCACCCCGC
ATGCGCGAGG AGTTCACCAT GGCGGGCGAG TCCCTGGTGG TGGAGCGTGC CGCCCCGCGC
ACCGAGCCGA CCGTGGGCGT CGCCGAGGCG GAGCCCGGCG CGGAGTACGC CAGCTCGCCT
GCCGGCGCGG GCGGGGAGAC TTCCGCCAAC GCTTCGGAGC CGCGGCGCGG TCGCGGTCGC
GGTCGTGGCC GCGCGGCACG AGCGCTGGAG GAGCCCGCGG CTGAAGGGGC GGCGGTCGAG
GCGCATGTCG CCGCCACGGT GGAGGAGGGC GAGACCGCCA TCGAAAGCGC GCCGGCAGCG
CCTTTCTTGC CGGCGGAGAC GGTGGTAGAG GCGACCTCCG AGGGCGCCGG GCACGCCGAG
GACGCCGCGG TGGCGGTCGA GCCTCGCGCA GCCGAGGCCG AGCCCGAGTC CGCGCAGCCG
AAGCGCTCGT CCGCGACGCG CAGGCGGCGT GGCGGCAAGG CCGAGGCGCC CGTCCTGCCG
GCCGCCGCTG CGGCCACCGT CGCCGGCGCG GACCATCCGC CTGCTCCTGC CGCGGCGAAC
GAAGCCCACG ACGCCGTGGC CGGCGCGGTG CAGGAGGAAA AGCCGAAGGG CAAAGGCAAG
GGGAAGGACA AGCAGAAGCC GAAGGAGAAA CCCTCCGAGG CCGCAAAGCC GTCTTCGCGG
CGTGGTCGCG GTGGGCGCGC TGCGGGTGCC AAGGCCGCTG CGGAAGGGGC GGGCGGGACG
GCGACCGCGG CGGGCCCGTC GGCGAGCGCC GCGCCCCTGA CGAGCGCCGC GCCCGCCGCG
AGCACGCCCG CGGCGGAGAA CCCCGCCCCG CGTCCGCGCC GCGCGCGCAA GCCGAAGGCC
GCGTCGGCCG AATGA
 
Protein sequence
MASSPDSISM ALFCDFENVA LGVRDAKYEK FDIKRVLERL LLKGSIVVKK AYCDWDRYKS 
FKAAMHEANF ELIEIPHVRQ SGKNSADIRL VVDALDLCYT KSHVDTFVII SGDSDFSPLV
SKLRENAKQV IGVGVKQSTS DLLIANCDEF IFYDDLVRDS QRAAAKREAR DNPPASRKQP
EEDKARREEL ESRRSKAIDI AVETFDALLA ERGESGKIWS SMLKEAIKRR KPDFNESYFG
FRAFGNLLEE AQARGLLELG RDEKSGIFVT RPGPGLAAPR MREEFTMAGE SLVVERAAPR
TEPTVGVAEA EPGAEYASSP AGAGGETSAN ASEPRRGRGR GRGRAARALE EPAAEGAAVE
AHVAATVEEG ETAIESAPAA PFLPAETVVE ATSEGAGHAE DAAVAVEPRA AEAEPESAQP
KRSSATRRRR GGKAEAPVLP AAAAATVAGA DHPPAPAAAN EAHDAVAGAV QEEKPKGKGK
GKDKQKPKEK PSEAAKPSSR RGRGGRAAGA KAAAEGAGGT ATAAGPSASA APLTSAAPAA
STPAAENPAP RPRRARKPKA ASAE