Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0540 |
Symbol | |
ID | 7085154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 608998 |
End bp | 609897 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697567 |
Product | hypothetical protein |
Protein accession | YP_002354209 |
Protein GI | 217968975 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.612157 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAAGA ATTCCCCCCT GCGCGCCCGC AACTTCCCCT CTGCCGAAGA CGAGGCCAAG GCCGCGCAGC CGCAGGGCCG CTACGACGGC CCCGGCAGCT CCTTCCGCAT GGCCTTCACC GACACCGAGT TCCTGCTGCG CGACGAGTTG CGCCCGGTGC GGCTGCAGCT CGAGCTGCTC AAGGCCGAGC TGGTGCAGCA GGAGCAGGGG GTGGAGTCGA CCGTGGTGGT GTTCGGCAGC GCGCGCTTCA AGGCGCCGGA CGTGGCCGAG GCGATGCTGC GCGACGCGCT GGCGAGCGGC GACGAGGCGG CGACCGCGCG CGCGCGTCAG ATGGTGAAGA ACGCGCGTTG GTACGAGGAG GCGCGCCGCT TCGGCGAGCT GGTCACGCGC GAGTCCGAGG CGCTCGGCGA GCCGGTGATC GTCGCCACCG GCGGCGGTCC GGGGATCATG GAGGCGGGCA ACCGCGGCGC CTTCGAGGCC GGCGGGCGCA GCATGGGGAT GAGCATCTTC CTGCCCTTCG AGGAGGCGCC CAACCCCTAC ATCACGCCCG AGCTGTGCTT CCAGTTCCAC TACTTCGCGA TCCGCAAGAT GCACTTCCTG ATGCGCGCGG TGGCGCTGGT GAGCTTCCCC GGCGGGCTGG GCACGCTCGA CGAACTCTTC GAGGTGCTGA CGCTGACGCA GACGCGCAAG ATCCGCCGCC GCCCGATCGT GCTGATCGGG CGCGACTTCT GGCAGCGCCT GATCGACTTC GACGTGCTGG TCGAGCACGG CGTGATCAGC CCCGAGGACA AGAACCTGTT CCACTACGCC GAGACCGCCG AGGAAGCCTG GGACGCGATC AAGGCCGCGT ACAGTGGCGA CAATCCCTCG CTGACGGCGC GGCAGTTGAA GGGCAACTGA
|
Protein sequence | MSKNSPLRAR NFPSAEDEAK AAQPQGRYDG PGSSFRMAFT DTEFLLRDEL RPVRLQLELL KAELVQQEQG VESTVVVFGS ARFKAPDVAE AMLRDALASG DEAATARARQ MVKNARWYEE ARRFGELVTR ESEALGEPVI VATGGGPGIM EAGNRGAFEA GGRSMGMSIF LPFEEAPNPY ITPELCFQFH YFAIRKMHFL MRAVALVSFP GGLGTLDELF EVLTLTQTRK IRRRPIVLIG RDFWQRLIDF DVLVEHGVIS PEDKNLFHYA ETAEEAWDAI KAAYSGDNPS LTARQLKGN
|
| |