Gene Tmz1t_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_0540 
Symbol 
ID7085154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp608998 
End bp609897 
Gene Length900 bp 
Protein Length299 aa 
Translation table11 
GC content69% 
IMG OID643697567 
Producthypothetical protein 
Protein accessionYP_002354209 
Protein GI217968975 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.612157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGA ATTCCCCCCT GCGCGCCCGC AACTTCCCCT CTGCCGAAGA CGAGGCCAAG 
GCCGCGCAGC CGCAGGGCCG CTACGACGGC CCCGGCAGCT CCTTCCGCAT GGCCTTCACC
GACACCGAGT TCCTGCTGCG CGACGAGTTG CGCCCGGTGC GGCTGCAGCT CGAGCTGCTC
AAGGCCGAGC TGGTGCAGCA GGAGCAGGGG GTGGAGTCGA CCGTGGTGGT GTTCGGCAGC
GCGCGCTTCA AGGCGCCGGA CGTGGCCGAG GCGATGCTGC GCGACGCGCT GGCGAGCGGC
GACGAGGCGG CGACCGCGCG CGCGCGTCAG ATGGTGAAGA ACGCGCGTTG GTACGAGGAG
GCGCGCCGCT TCGGCGAGCT GGTCACGCGC GAGTCCGAGG CGCTCGGCGA GCCGGTGATC
GTCGCCACCG GCGGCGGTCC GGGGATCATG GAGGCGGGCA ACCGCGGCGC CTTCGAGGCC
GGCGGGCGCA GCATGGGGAT GAGCATCTTC CTGCCCTTCG AGGAGGCGCC CAACCCCTAC
ATCACGCCCG AGCTGTGCTT CCAGTTCCAC TACTTCGCGA TCCGCAAGAT GCACTTCCTG
ATGCGCGCGG TGGCGCTGGT GAGCTTCCCC GGCGGGCTGG GCACGCTCGA CGAACTCTTC
GAGGTGCTGA CGCTGACGCA GACGCGCAAG ATCCGCCGCC GCCCGATCGT GCTGATCGGG
CGCGACTTCT GGCAGCGCCT GATCGACTTC GACGTGCTGG TCGAGCACGG CGTGATCAGC
CCCGAGGACA AGAACCTGTT CCACTACGCC GAGACCGCCG AGGAAGCCTG GGACGCGATC
AAGGCCGCGT ACAGTGGCGA CAATCCCTCG CTGACGGCGC GGCAGTTGAA GGGCAACTGA
 
Protein sequence
MSKNSPLRAR NFPSAEDEAK AAQPQGRYDG PGSSFRMAFT DTEFLLRDEL RPVRLQLELL 
KAELVQQEQG VESTVVVFGS ARFKAPDVAE AMLRDALASG DEAATARARQ MVKNARWYEE
ARRFGELVTR ESEALGEPVI VATGGGPGIM EAGNRGAFEA GGRSMGMSIF LPFEEAPNPY
ITPELCFQFH YFAIRKMHFL MRAVALVSFP GGLGTLDELF EVLTLTQTRK IRRRPIVLIG
RDFWQRLIDF DVLVEHGVIS PEDKNLFHYA ETAEEAWDAI KAAYSGDNPS LTARQLKGN