Gene Tmz1t_3962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3962 
Symbol 
ID7873608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4360778 
End bp4362196 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content71% 
IMG OID643700899 
Productpeptidase U32 
Protein accessionYP_002890922 
Protein GI237654608 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCT CCCCCCCGGC CCCGCGCCGC CCCGAACTGC TCGCCCCCGC CGGCACCCTC 
GACATGATGC GCACCGCCTT CGCCTATGGC GCGGACGCGG TCTACGCCGG CCAGCCGCGC
TATTCGCTGC GCGTGCGCAA CAACCACTTC GGCCAGCTCG ACACGCTCGC CGCCGGCATG
GCCGAGGCCC GCGCCGCCGG CAAGCTGTTC TATCTCGTCG CCAACATCTA TCCGCACAAC
GCCAAGCTGC GCACCTTCGA GGACGACATG GCGCCGGTGA TCGCGCTGCA GCCCGACGCG
CTGATCATGG CCGACCCCGG GCTGATCCTG ATGGTGCGCG AGCGCTGGCC CGAGCTGCCG
ATCCACCTCT CGGTGCAGGC CAACACTACC AACTACGCCT CGGTGCGCTT CTGGCAGTCG
GTGGGGGTCA AGCGCATCAT CCTGTCGCGC GAGCTGTCGC TGGACGAGGT CGCCGAGATC
CGCGACGCCT GCCCGGACAT GGAGCTGGAA GTCTTCGTGC ACGGTGCGCT GTGCATCGCC
TACTCCGGGC GCTGCCTGCT GTCGGGCTAC TTCAACCACC GCGACCCCAA CCAGGGCAGC
TGCACCAACT CCTGCCGCTG GGACTACAAG CTGCACGAGG CCGCCGAGGA CGCCGCCGGC
GACGTGCAGG CCTGCGGCGG CGCGCCGATC GGCAACCCCA AGGACGCCGG CGCGGTCGGC
ACCGCCACCC GCAGCGCGCT CGACACCGCG CAGGGCCTCG CACTCGGCGG CGGCCCGCGC
CACCTCGGCG GCAGCAAGCT GTGGCTGCTG GAAGAAGGCA CCCGCCCGGG CGAGCGGATG
CCGATCGAGG AAGACGAGCA CGGCACCTAC ATCCTCAACT CGAAGGATCT GCGCGCGATC
GAGCACGTGC AGCGCCTGGT CGAGATCGGC GTCGATTCGC TCAAGATCGA AGGCCGCACC
AAGAGCCCCT ACTACGTCGC CCGCGCCGCC CAGGGCTACC GGCGCGCGAT CGACGATGCG
GTGGCCGGGC GGCCCTTCGA TGTACGCCTG CTCGGCGAAC TCGAGGGCCT GGCCAGCCGC
GGCTACACCG ACGGCTTCTA TCAGCGCCAC TCCACCCCCG AGCAGCAGAA CTACCTGCGC
GGCCATTCCG AATCCGGGCG CAGCCTGCTG GTGGGCGAGG TGGTCGGCTG GGATGCCGCG
CGCGGCCTGG CCGAGGTCGA GGTCAAGAAC GGCTTCGGCG TCGGCGACCG GCTGGAGTTC
GTACAGCCGG GCGGCAACAC CGAGGCGGTG CTGGAGCGGC TGTTCGGCGC CGATGGCGAG
GCGATCCAGC GTGTGCCGGG CAGCGGCCGG CGCGTCTGGC TGGCGCTGCC GGCGGATGCG
GACCCGGCGC GACCCTGCTT CATCGCGCGC TTTCTGTGA
 
Protein sequence
MPASPPAPRR PELLAPAGTL DMMRTAFAYG ADAVYAGQPR YSLRVRNNHF GQLDTLAAGM 
AEARAAGKLF YLVANIYPHN AKLRTFEDDM APVIALQPDA LIMADPGLIL MVRERWPELP
IHLSVQANTT NYASVRFWQS VGVKRIILSR ELSLDEVAEI RDACPDMELE VFVHGALCIA
YSGRCLLSGY FNHRDPNQGS CTNSCRWDYK LHEAAEDAAG DVQACGGAPI GNPKDAGAVG
TATRSALDTA QGLALGGGPR HLGGSKLWLL EEGTRPGERM PIEEDEHGTY ILNSKDLRAI
EHVQRLVEIG VDSLKIEGRT KSPYYVARAA QGYRRAIDDA VAGRPFDVRL LGELEGLASR
GYTDGFYQRH STPEQQNYLR GHSESGRSLL VGEVVGWDAA RGLAEVEVKN GFGVGDRLEF
VQPGGNTEAV LERLFGADGE AIQRVPGSGR RVWLALPADA DPARPCFIAR FL