Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1330 |
Symbol | |
ID | 7084451 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1469795 |
End bp | 1470844 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643698347 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002354985 |
Protein GI | 217969751 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCGTC GCTTGAGCCT GCATCGCCTC CCCTGGCTTG TGCTCGCGCT CGCACTCTCC GGCACGGCCT GCGCGACCGA GGTGTTGCGC GTGCTGAGCT GGCCGGGCTA TGCCGACGCC GACGTGGTGC AGGCCTTCGA GGCCCGCACC GGCGCCCGGG TCGAAGTCAC CCAGGTCGAT TCCGACGAAA CGCTGTGGCA GAAGCTCAGC ACGAACGACG CGACCGACTA CGACGTGTTC GCGGTCAATA CCGCCGAGCT GCAGCGCTAC ATCGACCGCG GCGTGGCCGT GGCGATCGAT CCGGCCGCGC TGCCCAACCT CGGTGCGCAA TTGCCGCGCT TCCGCAACCC GGCAACGCTG CCCGGCACGA CTCGCGATGG CAAGCTGTAC GCCATCCCCT ATGCCTGGGC CGAAATGGGC CTGATCTACG ATCGGCGCCA GTTCGACGCG CCACCCCAGT CCATCGCCGC GCTGTGGGAC GCCCGCTACC GCGGCAAGGT GCTGGTGTAC AACAGCGGCT CGCACAACTT CTCGCTCGCC GCACAGATGC TGGGCAAGGC ATCGCCCTTC CGCCTCGACG CCGCCGACTG GGCGCCGGCG GTCGAGCGTC TGGTCGAGTT GCGCCGCAAC CTGCTGACCT TCTACGCCCA GCCGGAAGAA TCCGCGCATC TGTTCGTCAG CCGCGGCGCG GCGCTGATGT ACGCCAATTA CGGCACCCAG CAGCTGCAGC TCCTGCGCGC AGCGGGGGCG GACGTGGGCT ATGCGATCCC GCGCGAAGGC GCGCTCGCCT GGCTCGACTG CTGGGTGGTG ACGCGCGGCG CACGTAACCA GGCGCTCGCG CTGGCGTGGA TCGACCACCT GCTCGGCACC GGCCCGGCGC ACGTGCTGAG CGCGCGCCAC GGCCTCGACA ACACCCGCGA CCCGGCGCCG CACCAGGCCG AAACCGACCG CCTGGTCTGG CTCGAACCGG TCGAGGACGT CGAACGCCGC AACCTGCTGT GGGAGCGCAT CCTCTCCGGC GACCGCGGCG CACGGGTGCT CGCGCCATGA
|
Protein sequence | MLRRLSLHRL PWLVLALALS GTACATEVLR VLSWPGYADA DVVQAFEART GARVEVTQVD SDETLWQKLS TNDATDYDVF AVNTAELQRY IDRGVAVAID PAALPNLGAQ LPRFRNPATL PGTTRDGKLY AIPYAWAEMG LIYDRRQFDA PPQSIAALWD ARYRGKVLVY NSGSHNFSLA AQMLGKASPF RLDAADWAPA VERLVELRRN LLTFYAQPEE SAHLFVSRGA ALMYANYGTQ QLQLLRAAGA DVGYAIPREG ALAWLDCWVV TRGARNQALA LAWIDHLLGT GPAHVLSARH GLDNTRDPAP HQAETDRLVW LEPVEDVERR NLLWERILSG DRGARVLAP
|
| |