Gene Tmz1t_0090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0090 
Symbol 
ID7083473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp102550 
End bp104247 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content71% 
IMG OID643697137 
Productprotein of unknown function DUF342 
Protein accessionYP_002353786 
Protein GI217968552 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAGC AGCCCACGGA AGGCCCGGAA CTGGAGGTGT CGTTCGACGA ACTCACGCGC 
GTCCTGAGCG TGAGCATCGC CCATGACCCG CATTTTCCGC GCATCGACGC CTTGTGGCTG
CGCCGACGGC TCGAAGCCGC GGGCTACGCC GACCTGCAGA TCCGCCCCGA CCCGATCCGC
CGCCTGATCG CCCAGTACAA CGCCGGCGAG GCGGTGGCGC CGGTCGAGAT CGCGCAGTGC
GTGGATGCGT CGATGCAGAT CGGAGTCTCG CTGGACGGAC TGGTCGCACG CCTGAGCATC
GTGCCGGCCA AGGGTGGCAA GCCGGCGAGC AAGACCGAAC TGCTCGCCCT GATCGAATCG
CGCGGCATCG TCGAGGGCCT GCTGCTGGAG GAGATGAACC GCGCCATCGC CGACGGTCAG
GCCGACGACC TCGCCATCGC GCGCGGCCGG GAACCGGAAC CGGGCAAGGA CGGCTGGCTC
GAGTACCTGC TGCCGGAGAC GCGCGAGCGC GTACCCAGCG TGCGCCCCTG CGGCCGCACC
GACTACCGCG ACCTCGGCGA GATCCTCGTC GTCCACGCCG GCGATGCGCT GATGCGCCGC
CATCCCCCGC AGCCCGGAGT CGACGGCGTC AATGTGTACG GCCGGCCGAT CGTGGCGCGG
CGCGGTCGCG AGCAGCGCTT CGCCCCCGGC CTGCGCGGCA CCGCGATCTC GCCCGAGGAC
CCCGAGCTGC TCGTGGCTGC CTGCGACGGC CAGCCGGTGC GGGTGCGCAA CGGCGCGATG
GTGGAGCCGA TCTTCACCGT GGATGCGGTC AACCTCGCCA CCGGCAACAT CGACTTCGAC
GGCAGCGTGC GCATCCGCAA CGATGTGCAG GCCGGGATGA CGGTGCGCGC CAGCGGCGAC
ATCGAGGTCG GCGGCGTGGT CGAGCCGGCC ACGCTGGAGG CCGGCGGCAG CATCGTGGTC
AAGGGCGGCG TGCTCGGCGG GCTGGGCGGC AAGACCGCCG GCAAGGATTA CAGCGCGCAC
GCGATCCGCT GCGAGGGCAG CTTCTGCGCC ACCTACGCGC AGCAGGCGCG CATCAGCGCG
GGCGACTCGA TCTTCATCGA CGACGTCGCC ATGCAGTGCC AGCTCGAGGC ACGCAACCAC
ATCCGCGTAG GCAAGCGCCT GCGCGGCCAG ATCGTCGGCG GCCATTGCCG CGCCAGCCTG
TCCATCCACG CGCGCACGAT CGGCGCGAAC AGCCGCATCC GCACCGAGCT CGAGATCGGC
ATGGACAACG GGCTGGAACA CGCCATCCAG GAGAAGGCCG AGGCGCGCGA CGCACTCGAG
AACCGCTTGC TCGAGATCGG CAAGATGCTC ACCTTCGCCG ACCGCCATCC CGATCGCGTG
ACACCCGAGA TGCTGGGACG TGCCGAGCAG ACGGCGAGCG CGCTGTCGGG CGAGATCGAG
AGCTTGCGCA GCGAGGAGGA GGATCTGCAG CACCGCCTCG CGCTCACCCG CGAGGCACGG
GTGAATGCGG AGCGCGAGAT GTTCGAGGGC TGCATCGTGC GCATGGGCGA GCAGCTGTTC
AAGCTGTCGC AGGACCGCGG GCCGACCACG GTGCGACTGG CCACCCAGGG GCTGGGCGTG
TTCCCGCTCG AGGACGACAG CCGCTTCGAC GAGCCGCAGC GCCCGGCGGC GGCCAGCCAG
GGCTCGTCCC GGCGCTGA
 
Protein sequence
MDKQPTEGPE LEVSFDELTR VLSVSIAHDP HFPRIDALWL RRRLEAAGYA DLQIRPDPIR 
RLIAQYNAGE AVAPVEIAQC VDASMQIGVS LDGLVARLSI VPAKGGKPAS KTELLALIES
RGIVEGLLLE EMNRAIADGQ ADDLAIARGR EPEPGKDGWL EYLLPETRER VPSVRPCGRT
DYRDLGEILV VHAGDALMRR HPPQPGVDGV NVYGRPIVAR RGREQRFAPG LRGTAISPED
PELLVAACDG QPVRVRNGAM VEPIFTVDAV NLATGNIDFD GSVRIRNDVQ AGMTVRASGD
IEVGGVVEPA TLEAGGSIVV KGGVLGGLGG KTAGKDYSAH AIRCEGSFCA TYAQQARISA
GDSIFIDDVA MQCQLEARNH IRVGKRLRGQ IVGGHCRASL SIHARTIGAN SRIRTELEIG
MDNGLEHAIQ EKAEARDALE NRLLEIGKML TFADRHPDRV TPEMLGRAEQ TASALSGEIE
SLRSEEEDLQ HRLALTREAR VNAEREMFEG CIVRMGEQLF KLSQDRGPTT VRLATQGLGV
FPLEDDSRFD EPQRPAAASQ GSSRR