Gene Tmz1t_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1777 
Symbol 
ID7085747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1998619 
End bp1999725 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID643698799 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002355425 
Protein GI217970191 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.111763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGACA TCGCTTCCAA GGCTCGACTG GCCCAGGCAG CCTCCCAGCT GCCTGTCTCG 
TGGTACTTCG ACCAGCAGGT TTTCGAACTG GAGAAGAAAC TCCTCTTCGA TGCCGGTCCG
GGGTACGTCG GCCATGAGCT GATGGTTCCC GAGGTCGGCA ACTACCGCTC GCTGGAGTGG
CTCGACCACG CCAAGCTGCT GTTCCGCACC GAGGCGGGTG TGCACCAGAT GTCCAACGTC
TGCCGCCACC GCCAGGCGAT CATGCTGCAG GGCAGCGGCA CGACCAAGCT CGTGGTCTGC
CCGGTGCACC GCTGGACCTA CGACCGCCAG GGCAACCTGC TCGGCGCACC GCACTTCCCC
GAGAAGCCCT GCCTGGGCCT CAGGCGCGAC GAGCTCGAAC GCTGGAATGG CCTGCTTTTC
AAGGGCCCGC GCTCGGCGAG CGCCGACCTT GCCGGGATGC AGGTGGCGGG CGAATTCGAT
TTCTCGGGCT ACAAGCTCGA CAAGGTCGAG GTCCACCATT GCAATTACAA CTGGAAGACC
TTCATCGAGG TCTATCTGGA GGACTATCAC GTCGTGCCCT TCCACCCCGG ACTGGGCAAC
TTCGTGACCT GCAAGGACCT GAGCTGGCAG TTCGGCGACT GGTATTCGGT ACAGAAGGTG
GGCATCACCT CGCTCGCCAA GCCGGGCTCG GAGACCTACG CCAAGTGGCA CAAGGCGGTG
ATCGACTACT ACGGCGAGAA GAAGCCGACG CATGGTGCGA TCTGGCTCAC CTACTACCCG
AATGTGATGG TGGAGTGGTA CCCGCACGTG CTGGTGGTGA GCACGCTGAT TCCGACGGAT
GTGGATAAGA CGACCAACGT GGTGGAGTTC TACTACCCGG AGGACATCGT CGAGTTCGAG
CGCGAGTTCG TCGAGGCCGA GCAGGCCGCC TACATGGAGA CTGCGATCGA GGACGACGAG
ATCGGCGAGC GCATGGATCG CGGCCGGAGG GCGCTGCTGA AGGAGGGGCG CAACGAGGTC
GGCCCCTACC AGTCGCCCTT CGAGGACGGC ATGCAGCATT TCCACGAGTT CTACCGGCGC
ATCATGGAGC CGCACATCGG CGGGTGA
 
Protein sequence
MSDIASKARL AQAASQLPVS WYFDQQVFEL EKKLLFDAGP GYVGHELMVP EVGNYRSLEW 
LDHAKLLFRT EAGVHQMSNV CRHRQAIMLQ GSGTTKLVVC PVHRWTYDRQ GNLLGAPHFP
EKPCLGLRRD ELERWNGLLF KGPRSASADL AGMQVAGEFD FSGYKLDKVE VHHCNYNWKT
FIEVYLEDYH VVPFHPGLGN FVTCKDLSWQ FGDWYSVQKV GITSLAKPGS ETYAKWHKAV
IDYYGEKKPT HGAIWLTYYP NVMVEWYPHV LVVSTLIPTD VDKTTNVVEF YYPEDIVEFE
REFVEAEQAA YMETAIEDDE IGERMDRGRR ALLKEGRNEV GPYQSPFEDG MQHFHEFYRR
IMEPHIGG