Gene Tmz1t_0005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0005 
Symbol 
ID7085103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp7271 
End bp8407 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content54% 
IMG OID643697055 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_002353704 
Protein GI217968470 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGT TTTTCGATGT AACCCTTGGC GAGGTCGTCG ATTTTTTCAA TGGCAAGGCC 
ATCAAGCCGG GTCAGGACGG AGAGTATCCA GCGTATGGCT CGAATGGGCT GATCGGAGGC
GCACCGGACT GGAAGTATGA AAACTCCATC ATCATCGGGC GCGTGGGGGC GTACTGCGGT
TCGGTTGCAT ACTGCAAGAG TCGGTTCTGG GCTTCTGATA ACACGATCGT GGCAAGGCCC
AAGAGCGGGG ATGTCGGGTA TTTCTACTAT CTCCTGAAAG CACTGGAACT CAACCGTTAT
GCCGGAGGTG CGGCGCAGCC ACTTGTCACA CAAACGGTTC TAAAAGGTGT TCCTGCAAGA
GTTCCTGACA TCCCAACCCA GCGCCGCATT GCCTCCATCC TGTCCGCCTA CGACGACCTG
ATCGAAAACA ACACGCGACG GATCGCCATC CTTGAGGAAA TGGCCCGGAG AATCTACGAG
GAGTGGTTCG TCCGCTTCCG TTTTCCGGGG CATGAACAGG TGAAGATGGT GGAGTCTGAG
CTGGGGTTGA TCCCGGAGGG GTGGAAGGCG ACGAATATCG GAGAGGTTGC CGAGAATCAC
GATAGAAAGC GCAAACCTTT ATCGAAGATG CAGCGGGAGA AGTTCAAGGG GCCATATCCG
TACTATGGCG CTGCAAAAAT CTTTGACTAC GTTGAGGATT ACATTTTTGA TGGGCGATTC
GTCCTCATGG CAGAAGACGG TAGCGTCATC ACCCCCGATG GATTTCCCGT TCTTCAGTTG
GCCAATGGGA GATTCTGGGC GAATAACCAT ACGCACATTT TGCGCGGAAC GCCGGATGCA
TCGACTGAGT TTATTTACCT CAGACTGTCT TCGCAAAAGG TAAGTGGCTA CATAACCGGA
GCTGCACAGC CGAAGATCAC ACAGGCAAAC ATGAATCGAA TACCGGTTTG TCTGCCGCCG
CGAGACTTGA TGGCGCGATT TACGGAATTG GTGGGGCCGA AGTTCGATCT CATCGACTGC
TTGGAAAGGA AACACACCAA TCTCAGAGCT ACCCGAGACC TCCTGCTCCC CAAGCTGATC
TCCGGCGAAC TCGACGTTTC CACCCTGCCC GAACCTGAGG AGGCCATCGC GGCATGA
 
Protein sequence
MNEFFDVTLG EVVDFFNGKA IKPGQDGEYP AYGSNGLIGG APDWKYENSI IIGRVGAYCG 
SVAYCKSRFW ASDNTIVARP KSGDVGYFYY LLKALELNRY AGGAAQPLVT QTVLKGVPAR
VPDIPTQRRI ASILSAYDDL IENNTRRIAI LEEMARRIYE EWFVRFRFPG HEQVKMVESE
LGLIPEGWKA TNIGEVAENH DRKRKPLSKM QREKFKGPYP YYGAAKIFDY VEDYIFDGRF
VLMAEDGSVI TPDGFPVLQL ANGRFWANNH THILRGTPDA STEFIYLRLS SQKVSGYITG
AAQPKITQAN MNRIPVCLPP RDLMARFTEL VGPKFDLIDC LERKHTNLRA TRDLLLPKLI
SGELDVSTLP EPEEAIAA