Gene Tmz1t_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0697 
Symbol 
ID7083926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp780374 
End bp781549 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID643697723 
Productprotein of unknown function DUF185 
Protein accessionYP_002354365 
Protein GI217969131 
COG category[S] Function unknown 
COG ID[COG1565] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.141855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCAT TGCCCCAGCC TTCCGCCGAT GCGCTCGCCC AGAGCGCTCG CCTGCTCGAA 
CACATCGAGG CCGAGCTGGC CGCGGCCGCT GGCTGGATCC CGTTCGCGCG CTACATGGAG
CTCGCACTCT ACGCGCCGGG GCTGGGGTAT TACAGCGGTG GCGCGCGAAA GTTCGGCCCC
GGCGGCGATT TCATCACCGC GCCCGAGCTT ACCCCGCTTT TCGGCCAGGC GCTCGCCGCC
CAGGTCGAGC AGGTGATGCG CGCGAGCACG CCCGCGCTGA TCGAGGTCGG CGCCGGTACC
GGCCTGCTCG CCGCCGACCT GCTGCTCGAG CTCGAACGCC GCGGCTGCCT GCCCGAGCGC
TACGGCATCC TCGAGCTCTC GGGCGAATTG CGCGAACGCC AGTTCGACAC CCTGGCCGCC
AAGGTTCCTC ACCTGGCAGC GCGCGTGCAT TGGCTGGACG CGCTGCCTGA GCGCTTCTCC
GGTGCAGTGG TGGCCAACGA GGTGCTCGAC GTGATGCCGG TGCATCTGCT GGTGTCGCGC
GCCGAGGGGC TCTTCGAGCG CGGCGTCGCC ATCGCCACCG ATGCCGCGGG GATACGCCGG
CTGTGCTGGG CGGACGTGCC GGCGGCGGGC GCGGTGGCGG AAGGAGCGCG GGCGCTCGCC
CTGCCGGTGC CGCAGAGCGG GGAATACGTC ACCGAGCTGA ACCTCGCCGG CAAGGCCTGG
GTGGCGGCCT GGGCCGAGCG CCTGCACGCG GGCGCCCTGC TGCTGATCGA CTACGGCTAT
CCGCGCGCCG AGTACTACCT GCCCTCGCGT TCGGGCGGCA CCCTGCTGTG CTACTACCGC
CACCATGCCC ACGGCGACCC TTTCCTGTGG CCGGGGCTCA ACGACATCAC CGCCTTCGTG
GACTTCACCG CGGTGGCCGA GGCCGGCTTC GAGGCCGGGC TGGACGTGCA GGGCTACACC
ACGCAGGCGC AGTTCCTCTT CAACTGCGGC GTGCTGGAAT GCCTGGAGCG GCGCGGCGCC
CGCGAGAGCG CGGACTACAT CCGCGCCGCG CGCGCGGTGC AGCGCCTGAC CGCGCCGCAG
GAGATGGGGG AGCTCTTCAA GGTGATCGCG CTGTCGCGCG CGATCGACGG ACCGCTGCTC
GGCTTCGCGC GCGGCGATCG TACGCACGCG CTCTGA
 
Protein sequence
MSSLPQPSAD ALAQSARLLE HIEAELAAAA GWIPFARYME LALYAPGLGY YSGGARKFGP 
GGDFITAPEL TPLFGQALAA QVEQVMRAST PALIEVGAGT GLLAADLLLE LERRGCLPER
YGILELSGEL RERQFDTLAA KVPHLAARVH WLDALPERFS GAVVANEVLD VMPVHLLVSR
AEGLFERGVA IATDAAGIRR LCWADVPAAG AVAEGARALA LPVPQSGEYV TELNLAGKAW
VAAWAERLHA GALLLIDYGY PRAEYYLPSR SGGTLLCYYR HHAHGDPFLW PGLNDITAFV
DFTAVAEAGF EAGLDVQGYT TQAQFLFNCG VLECLERRGA RESADYIRAA RAVQRLTAPQ
EMGELFKVIA LSRAIDGPLL GFARGDRTHA L