Gene Tmz1t_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1192 
Symbol 
ID7083852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1321627 
End bp1322667 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content74% 
IMG OID643698208 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_002354847 
Protein GI217969613 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.978581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGC TCGGCATCGA AACCTCCTGC GACGAGACCG GCGTGGCGAT CTACGACACC 
GACACCGGCC TGCGCGCGCA CTGCCTGCAC TCGCAGATCG ACCTGCACGC GGCCTACGGC
GGGGTGGTGC CCGAGCTCGC CTCGCGCGAT CACATCCGCC GCCTGCCGCT GCTGCTGCGC
CAGACCCTGG CCGCGGCCGG CCTCGCCGCC GCGGACATCG ACGCCGTGGC CTACACCAGC
GGTCCCGGGC TCGCGGGCGC GCTGCTGGTG GGGGCGAGCG TGGCGGAGTC GTTCGCGATG
GCGCGCGGGA TCCCGGCGCT GCCGGTGCAT CACCTCGAGG GCCACCTGCT GTCGCCGCTG
CTGTCGGCCG ATCCTCCGGC CTTCCCCTTC GTGGCCTTGC TGGTTTCGGG CGGCCACACC
CAGCTGATGC GGGTGCGCGG GGTGGGCGAC TACGCGCTGC TCGGCGAGTC GGTGGACGAC
GCCGCCGGCG AGGCCTTCGA CAAGACCGCC AAGCTGCTCG GCCTGGGCTA TCCCGGCGGG
CCGCAGCTCG CCGGGCTGGC CGAATCCGGC GTGCCCGGCC GCTTCCGCCT GCCGCGGCCG
ATGCTGCACT CGGGGGACCT CGACTTCAGC TTCAGCGGTC TCAAGACCGC GGTGCTCAAC
GTGGTGTCGG CGCCCGACTG GGACCCGGCG CGCATGGCCG ACCTCGCTGC CGAGTTTCAG
CAGGCCGTGG TCGATGTGCT GTGCGCGAAG GCGCTCGCGG CGCTGAAGAA GGTGGGGCTG
AAGACCCTCG TAGTGGCCGG CGGGGTGGGT GCCAACCGCT GCCTGCGCGC CACCCTGGAC
GCCGCGCTCG CGCGCCGCGG CGGGCGCGTG CATTACCCCG AGCCGGCGCT GTGCACCGAC
AACGGCGCCA TGATCGCCTT CGCCGGCGCC TTGCGCCTGG CGGCCGGCGA GTCCGTGCCG
GAAGTCTGCG CCGTCCGCAT CCGCCCGCGC TGGCCCATGG TCGAACTGCG CCCGCCGGTG
CAGGCGCCCG CCATCCTGTA G
 
Protein sequence
MKVLGIETSC DETGVAIYDT DTGLRAHCLH SQIDLHAAYG GVVPELASRD HIRRLPLLLR 
QTLAAAGLAA ADIDAVAYTS GPGLAGALLV GASVAESFAM ARGIPALPVH HLEGHLLSPL
LSADPPAFPF VALLVSGGHT QLMRVRGVGD YALLGESVDD AAGEAFDKTA KLLGLGYPGG
PQLAGLAESG VPGRFRLPRP MLHSGDLDFS FSGLKTAVLN VVSAPDWDPA RMADLAAEFQ
QAVVDVLCAK ALAALKKVGL KTLVVAGGVG ANRCLRATLD AALARRGGRV HYPEPALCTD
NGAMIAFAGA LRLAAGESVP EVCAVRIRPR WPMVELRPPV QAPAIL