Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3500 |
Symbol | |
ID | 7873006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3837388 |
End bp | 3838326 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700440 |
Product | periplasmic solute binding protein |
Protein accession | YP_002890471 |
Protein GI | 237654157 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.639001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTCC CCTACCGCTT GCGCCGCCTG CGCATCCTCG CGGCCTGCGC CGCCCTGCTC CTGGGCGGCA GCGCCCACGC CCAGGCGCTC GAAGTGGCGA CCAGCTTCAG CATCCTCGGC GACCTCGTCG CCCAGGTCGG CGGCGAGCGC ATCAAGGTGC GCACCCTGGT CGGTCCCGAC GAGGACGCCC ACGCCTTCCA GCCGCGTCCC TCGGACGCGC GCGAGATCGG CAAGGCTGCG CTGGTGGTGG TCAACGGCCT CGGCTTCGAC GACTGGATGA CGCGGCTGGC GCGCGCCGGC GGATTCAAGG GGGCGGTGGT GGTGGCGAGC GCGGGGATCT CGACGCTGGA GATGAGGCGC GACGACGGGC ACGACCATGG CCACGATCAT GGCAAGGGCA AGGCGGTCGA CCCGCACGCC TGGCAGGACG TGGCGAACGT GCGCCGCTAC GTCGCCAACA TCGCCGCCGC CCTCGCCACC GCCGACCCGG ACGGCGCGGC CATCTACCGC GCCGCGGCCG CACGCTACGA CGGCGAGTTG CAGGCGCTCG ACGCGGAGAT CCGTGCCGCC TTCGCGGCGC TGCCCGCCGA GCGCCGCAAG GTGGTGAGTT CGCACGCGGC CTTCGGGTAC TTCGCGCGCG CCTACGACAT CCGCTTCCTG TCGCCCGTCG GCGTGGCGAA CAACGCCGAG CCCACCGCCA AGGGCGTGGC CGGCCTGATC CGCCAGCTCG CGGCGGAAAA GGTGCCGGCG GTGTTCATCG AGAACATCGC CGATCCTCGC CTGATCGAGC GCATCCGCAG CGAAAGCGGC GCGGTCGTGG GCGGCACGCT GTATTCCGAC GCACTGTCGA AGGCTGACGG CCCGGCGCCC AGCTACGTAC GCATGATGCG CGCCAACCTG GCGACGCTGC AGAAGGCACT CGCGGCACCC GAGCGCTGA
|
Protein sequence | MRFPYRLRRL RILAACAALL LGGSAHAQAL EVATSFSILG DLVAQVGGER IKVRTLVGPD EDAHAFQPRP SDAREIGKAA LVVVNGLGFD DWMTRLARAG GFKGAVVVAS AGISTLEMRR DDGHDHGHDH GKGKAVDPHA WQDVANVRRY VANIAAALAT ADPDGAAIYR AAAARYDGEL QALDAEIRAA FAALPAERRK VVSSHAAFGY FARAYDIRFL SPVGVANNAE PTAKGVAGLI RQLAAEKVPA VFIENIADPR LIERIRSESG AVVGGTLYSD ALSKADGPAP SYVRMMRANL ATLQKALAAP ER
|
| |