Gene Tmz1t_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2301 
Symbol 
ID7085286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2587687 
End bp2589534 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content71% 
IMG OID643699320 
Productpeptidase M61 domain protein 
Protein accessionYP_002355936 
Protein GI217970702 
COG category[R] General function prediction only 
COG ID[COG3975] Predicted protease with the C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.9294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCC GTTCCACCCG CTTGTCCGCC GCCGCGTCGG CGCTCCATGG CCGTCCGCCG 
CCCGCGGTCG AATACCGCAT CCGCCCGGCG AACCCCGGCG CCCACCTCTT CGAGGTGAGC
TGCACGGTGG CCGAGCCCGA CCCCGCGGGG CAGGTCTTCA GCCTGCCGGC GTGGATCCCG
GGCAGCTACA TGATCCGCGA GTTCGCGCGC AACATCGTGC GCCTGCGTGC CGAGGCCGAC
GGCGAGCCCT GCGCGCTGGA GAAGCTCGAC AAGCACACCT GGCGCGCGGC TGCGGTGCCG
GGCGCGCGCG TGCTGAGCGT GCATTACGAG GTCTATGCCT GGGACCTGTC GGTGCGCACC
GCCCATCTCG ACACCACACA CGGCTTCTTC AACGGCACGA GCGTCTTCCT CGCGGTGGCG
GGGCGTACCG AAGCACCTTG CGTGGTGACC ATCGAACGCC CCGAGGGCGA CTGCGGGCGC
GACTGGAAGC TCGTCACCGC GCTGCCGCCC GAGCACGGCC ACCCGGGCCA GGCCTGCCGC
TTCGGCCGCT TCCGCGCCGC GGACTACGAC GAGCTCATCG ACCACCCGGT CGAGATGGGC
CGTTTCACGC TGGCACGCTT CGAGGCCGCG GGCGTGCCGC ACGACATCGC CCTCACCGGC
CGCCACGACT GCGACCTCGA GCGCCTGTGC GCCGACCTGC GCCGGGTGTG CGAATGGCAG
ATCGCGCTCT TCGGCACGCC GGCGCCGGTG GACTATTACG CCTTCCTGAC CATGGTGGTG
GGAGAGGGCT ACGGCGGGCT GGAGCATCGC GCCTCGACCG CGCTGATCTG CAGCCGCGCC
GAGCTGCCGT GGAAGGGCAT GGAGGGTCTG CCCGACGGCT ACAAGAGCTT CCTCGGCCTG
TGCAGCCACG AGTATTTCCA CACCTGGAAC GTCAAGCGCA TCAAGCCGGT GGCGTTCACG
CCCTACGATC TCGCGCGCGA GAACCACACC CGGCTGCTGT GGGCCTTCGA GGGCTTCACC
TCGTATTACG ACGACCTCGC CCTGGTGCGC AGCGGCGTGA TCGGCATCGA TGACTACCTG
GGGCTGCTCG GCAAGACCAT CGCCAACGTC TTGCGCGGCA GCGGCCGGCT CAAGCAGAGC
GTGGCGGAGT CCTCCTTCGA CGCGTGGACC AAGTACTACC GCCAGGACGA GAACGCACCC
AATGCCATCG TCAGCTACTA CGCCAAGGGT GCGCTGATCG CGCTCGCGCT CGACCTGCAG
CTGCGCGCGG GCAGCGAGGG TGCGGCCAGC CTGGACGACG TGATGCGGCT GCTGTGGCGG
CGCCACGGCC TCACCGGCGT GGGCGTGCCG GAGGATGGCA TCTTCGCCGC GGTGCGCGAC
GCGGGCGGCG AACGCCTCGG CGCGCGCCTG GCGAAATGGC TGCAGAAGGC GGTGGACGGC
TGCGAGGATC TGCCGCTGGC GCGCCTGCTG CGTCCCTTCG GCGTGAGCCT GCGCGCCGAG
GCGGCGGGGA CCGCGCCGGT GCTCGGGATG AAGCTCGGCG GGGGCAGTGG CGAGGCGAAG
GTCGCCAATG TGTACGACGA CGGTCCGGCG CAGGCGGCGG GCGTCTCGGC CGGGGACGTG
CTGATCGCGC TCGACGGGCT GAGGATCTCC AGCGCCAAGG GGCTGGAGGA TCTGCTCGCC
CGTCGTGGTG CGGGCGACGA GGTGGAACTG CATCTCTTCC GTCGCGACGA GCTGATGAGC
TTCCGTGCGG TGCTCGCTGC ACCGCCTGCC GAGCGCCAGG AGCTCAAGCT GGCGCCGCGC
GCCGATAGCG CGGCAGCGAA GCTGCGGCGG GGTTGGTTGG GGGGGTGA
 
Protein sequence
MSARSTRLSA AASALHGRPP PAVEYRIRPA NPGAHLFEVS CTVAEPDPAG QVFSLPAWIP 
GSYMIREFAR NIVRLRAEAD GEPCALEKLD KHTWRAAAVP GARVLSVHYE VYAWDLSVRT
AHLDTTHGFF NGTSVFLAVA GRTEAPCVVT IERPEGDCGR DWKLVTALPP EHGHPGQACR
FGRFRAADYD ELIDHPVEMG RFTLARFEAA GVPHDIALTG RHDCDLERLC ADLRRVCEWQ
IALFGTPAPV DYYAFLTMVV GEGYGGLEHR ASTALICSRA ELPWKGMEGL PDGYKSFLGL
CSHEYFHTWN VKRIKPVAFT PYDLARENHT RLLWAFEGFT SYYDDLALVR SGVIGIDDYL
GLLGKTIANV LRGSGRLKQS VAESSFDAWT KYYRQDENAP NAIVSYYAKG ALIALALDLQ
LRAGSEGAAS LDDVMRLLWR RHGLTGVGVP EDGIFAAVRD AGGERLGARL AKWLQKAVDG
CEDLPLARLL RPFGVSLRAE AAGTAPVLGM KLGGGSGEAK VANVYDDGPA QAAGVSAGDV
LIALDGLRIS SAKGLEDLLA RRGAGDEVEL HLFRRDELMS FRAVLAAPPA ERQELKLAPR
ADSAAAKLRR GWLGG