Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0901 |
Symbol | |
ID | 7084759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 993208 |
End bp | 994296 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643697924 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002354564 |
Protein GI | 217969330 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.457949 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTGA AGTTCGGCTG GATGGCGGTG CTGGCCCTGG CCGCCGGGGC GGCGGTGGCG GCCGAGGAGC CCATCCTCAA CGTCTACAAC TGGAACGACT ACATCGCCGC CGACACCATC CCCGCCTTCG AGAAGGCGAG CGGGATCGAG GTGCGCTACG ACCTCTACGA CAGCAACGCC ACGCTGCAGG GCAAGCTGCT GACCGGCGGC AGCGGCTACG ACGTTGTGTA CCCGAGCGTG GAGTACGCCG GCAAGCAGAT CCAGGCCGGC ATCTTCCAGC CACTCGACAA GTCGCGCCTG CCCAATCTCG TCCATATCGA CCCGCTGATC CTCGAGGCGG TGGCGGTCGC CGACCCCGGC AACCGCTACC TCGTACCCTA CATGTGGTTC ACCATCGGGG TGGCGATCAA CGTCGACAAG GTCATGAAGG CGCTCGACGG CAAGCTGCCC GACAACGCCT GGGACCTGCT CTTCGACCCT GCGCTGACCG CGCGCCTGAA GGGCTGCGGC ATCGCGCTGA TGGACGAGGC CAGCGACGTG ATCCCCGCGG CCATGCTCGA CGCCGGCCTC GACCCGGTGA AGATGGACCC GGCCGACATC CGCGCCGCGG TCGAGCACAT CCGCCCGGTG CGCAAGGACA TCCGCAGCTT CAACACCGCG CCCATCGAGC AGATGGCCAA GGGCAGCCTG TGCGTGGCGA TGATGTTCTC GGGCGACGCC CGCATCGCCG CGCGGCGCGC GCAGATCGGC GGCTCCGAGG TCAGGCTGCA GTACCTGATC CCGGCCGTGG GCGCGATGAT GTCGGTGGAC GTGATGGCGA TCCCGCGCGA CGCGCCGCAC CCGCATAACG CCCACCGCTG GATCGACGCG ATGATGGACC CGGCCACGGT GGCGCGCATC TCCAACGAGA CCTTCTACAT CAGCGCCAAC ACCGCGGCGC TCGCGCTCAC CGACGCCACG CTGCGCGACG ACCCGGCGAT CAACGTGCCG GAGGCGGTCA AGCGTACGCT GCGCGCCAAG CCGGTGCTCG GCAAGGACGT GCAGCGCGAG CTGACCCAGG CGCTGAATCG CTTCAAGGCG GCGCGCTGA
|
Protein sequence | MTLKFGWMAV LALAAGAAVA AEEPILNVYN WNDYIAADTI PAFEKASGIE VRYDLYDSNA TLQGKLLTGG SGYDVVYPSV EYAGKQIQAG IFQPLDKSRL PNLVHIDPLI LEAVAVADPG NRYLVPYMWF TIGVAINVDK VMKALDGKLP DNAWDLLFDP ALTARLKGCG IALMDEASDV IPAAMLDAGL DPVKMDPADI RAAVEHIRPV RKDIRSFNTA PIEQMAKGSL CVAMMFSGDA RIAARRAQIG GSEVRLQYLI PAVGAMMSVD VMAIPRDAPH PHNAHRWIDA MMDPATVARI SNETFYISAN TAALALTDAT LRDDPAINVP EAVKRTLRAK PVLGKDVQRE LTQALNRFKA AR
|
| |