Gene Tmz1t_2317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2317 
Symbol 
ID7085304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2607541 
End bp2609004 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content69% 
IMG OID643699338 
Productprotease Do 
Protein accessionYP_002355952 
Protein GI217970718 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCCG AAGTCCGCAT GCAAACGAGC GCCGCGAGGG CGCTCGCCGT GGTGCTCTGT 
CTCCTGCTGA TGCTCTGCGC GCCGAGGGGC TTCGCGGCCG GCACGGCAGA GCGGGCGATG
CTCCCCGATT TCACCCGCCT CGTGGCGACA CAGGGCGCCG CCGTGGTCAA TATCAGCGCC
ACCCAGGTGG CTGCGCAGCC GCAGAAGCAA CCCTTCCGTC TTCCGGAACT CGACGAGTCG
GATCCGATGT TCGAGTTCTT CCGCAAGTTC ATCCCGCGCA TGCCCGAGTA CCCGGGGGCG
GAGCCCGACG ACAAGTCGCT CGGCTCGGGC TTCATCATCA GCGCCGACGG TTTCATCCTC
ACCAACGCGC ATGTGGTGGA GGCCGCAGAG AGCATCGTGG TCCGCCTCGC CGACAAGCGC
GAGTTCGACG CCACGGTGAT CGGCGCCGAT GCGCGCAGCG ACGTCGCGCT GATCCGCATC
GAGGCCAAGG ACCTGCCTCA TGTCGTGCTC GGCGACCCCG AGGCGCTCGC GGTGGGCGAG
TGGGTGCTGG CGATCGGTTC GCCCTTCGGC TTCGAGCAGT CGGTCACCGC CGGCATCGTC
AGCGCCAAGG GGCGCAGTCT GCCCGACGAG AACTTCGTGC CCTTCATCCA GACCGACGTC
GCGATCAATC CGGGAAATTC GGGCGGGCCG CTGTTCAACC TGCGTGGCGA GGTGATCGGC
ATCAATTCGC AGATCTACAG TCGTACGGGC GGGTTCATGG GTCTGTCCTT CGCGATCCCG
ATCGATGTCG CGATGGACGT GCAGCAGCAA CTGCGCGAGA AGGGCAGGGT GGAGCGCGGG
CGCATCGGGG TGTCGATCCA GGAGATCACC CGCGACCTCG CCGACAGTTT CGGGCTGCCG
CGCCCTGCGG GTGCGCTGGT GAGCAGTGTG GAGGCCGGTG GTCCGGCGGC GCTCGGCGGC
GTCGTCCAGG GCGACGTGAT CGTGCGCTTC AACCAGCGCA ACGTCGAGAA TTCGGCCGAC
CTTCCGCGCA TCGTCGCGGC GGCGCGCCCG GGCAGCAAGG TCGAGGTCGA GATCTATCGC
GACGGGGCGC CGCGTTCCCT GAGCTTGACG CTGGGCGAAT GGCGCGACCC GGAGGAAGAG
GTCGAGCCCG TGGCGGTCGG TCTGGCCACG GGCGCGACCA ACCGCCTCGG CCTCGAACTC
GTCGCGCCCA CGGCGCAGCA GCGGCGCGAG CGCGGGCTTG CGCACGGCCT GTTGGTGCAG
CGTGCCGAAA AGTCCGCCGC GCGGGCCCAG ATCGTTCCCG GCGACCTCGT GCTGGCGATC
GTCGTCGAAG GGCGCCAGGC CAGGCTCGAT CGCATCGAGG ATTTCGAACG CGTGGTCGCC
GCACTCAAGC CCGGGCAGCA GGTCACCCTG CTGGTCGGGC GCGGCGAGAG CGCGTCCTAT
GTCAGCCTGC GCGCCGACAA GTGA
 
Protein sequence
MNPEVRMQTS AARALAVVLC LLLMLCAPRG FAAGTAERAM LPDFTRLVAT QGAAVVNISA 
TQVAAQPQKQ PFRLPELDES DPMFEFFRKF IPRMPEYPGA EPDDKSLGSG FIISADGFIL
TNAHVVEAAE SIVVRLADKR EFDATVIGAD ARSDVALIRI EAKDLPHVVL GDPEALAVGE
WVLAIGSPFG FEQSVTAGIV SAKGRSLPDE NFVPFIQTDV AINPGNSGGP LFNLRGEVIG
INSQIYSRTG GFMGLSFAIP IDVAMDVQQQ LREKGRVERG RIGVSIQEIT RDLADSFGLP
RPAGALVSSV EAGGPAALGG VVQGDVIVRF NQRNVENSAD LPRIVAAARP GSKVEVEIYR
DGAPRSLSLT LGEWRDPEEE VEPVAVGLAT GATNRLGLEL VAPTAQQRRE RGLAHGLLVQ
RAEKSAARAQ IVPGDLVLAI VVEGRQARLD RIEDFERVVA ALKPGQQVTL LVGRGESASY
VSLRADK