Gene Tmz1t_3594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3594 
Symbol 
ID7873099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3942247 
End bp3943302 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content60% 
IMG OID643700534 
ProductAppr-1-p processing domain protein 
Protein accessionYP_002890564 
Protein GI237654250 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000210462 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC TCACGCAAGG TGATCTGCTG AAGCAGGACG ATGTCGACGC CATCGTGAAC 
ACGGTGAACT GTGTCGGCGT GATGGGCAAG GGCATCGCGC TGCAATTCAA GAACAAGTGG
CCGGACAATT TTGCTGAGTA CGCGGCAGCT TGCAAGGCGG GGCAAGTGCG TCCGGGCCGA
ATGTTCATCC ACGACTCAGG CGGCCTAGTC AAGCCGAACT ACATCATCAA CTTCCCGACC
AAGGACCATT GGCGCGGCGC CTCTAGGATG GCGTTCATCC GCGACGGTTT GATCGACCTA
GTGACGCAGG TGCGGCGCCT CGGCATTCGG TCAATTGCCA TCCCGCCGCT AGGTTGCGGG
AACGGTGGGC TAGACTGGAC CCAAGTGCGG CCTTTGATCG AAGCTTCATT CGAAGCGCTT
CCCGATGTTG AAGTGCGACT CTTCGAACCT GGGGGTGCGC CCAATCCAAA GACGATGGAA
GTTCGGACCA AGCGTCCCCG CATGACGCCC GGCCGGGCAG CAATCGTCAA GGTCTTGAGC
ACGTACGGTG AGCTGAACTA CGGGCTATCC AAGATCGAGG TTCAGAAGCT TGCGTACTTT
CTGCAGGAGG CCGGCGAGCC GCTGCAGCTT CAGTTTGTGA AGCACCACTA CGGTCCGTAC
TCCGACACGC TACGCCACGC GCTGAACACG ATGGAAGGGC ACTTCATTCG CGGCCTGGGC
GATGGTGTTG TCGAAGCCGA AATCGAGCCC ACGGAAGACG CACTTGCCGA AGCCGAGGCG
TTCATCGCAA ACGAAGGCCA TTCGGCGCTC TCAGCCCGTG TTGAGCGCGT GGGGCGGCTT
ATCGATGGCT ACCAATCGTC GTATGGCATG GAACTGTTGG CCTCGGTTCA CTGGGTCGCG
GCACACGAGC CCGGCGTACG CTCGGTCGAT GAAGCGATTA CGGCGGTGCA CGGCTGGAAC
GATCGGAAGA AGCTGCTCAT GCAGCCCGAT CACGTTAAGT TTGCTTGGCA TCGGCTTGCT
GAGGAAGGCT GGCTTTCGTC TAGCGCTTTT CCGTAG
 
Protein sequence
MIKLTQGDLL KQDDVDAIVN TVNCVGVMGK GIALQFKNKW PDNFAEYAAA CKAGQVRPGR 
MFIHDSGGLV KPNYIINFPT KDHWRGASRM AFIRDGLIDL VTQVRRLGIR SIAIPPLGCG
NGGLDWTQVR PLIEASFEAL PDVEVRLFEP GGAPNPKTME VRTKRPRMTP GRAAIVKVLS
TYGELNYGLS KIEVQKLAYF LQEAGEPLQL QFVKHHYGPY SDTLRHALNT MEGHFIRGLG
DGVVEAEIEP TEDALAEAEA FIANEGHSAL SARVERVGRL IDGYQSSYGM ELLASVHWVA
AHEPGVRSVD EAITAVHGWN DRKKLLMQPD HVKFAWHRLA EEGWLSSSAF P