Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0628 |
Symbol | |
ID | 7084566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 706549 |
End bp | 707556 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643697655 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_002354297 |
Protein GI | 217969063 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCG GCCCCCTTCG CAGCACCCTG CTCACCCTGG CGCTCGCCAC CGGCCTCGCC GGCGTCGCCC ACGCGCAGAC CACCCTGCTC AACGTCTCCT ACGACCCCAC ACGCGAGCTC TACCAGGACT TCAACCCCGA GTTCGCCAAG CACTGGAAAG CCCGCACCGG CGAGACCGTC ACCATCAAGC AGTCCCACGG CGGCGCCGGG AAGCAGGCGC GCGCGGTGAT CGACGGCCTC GAAGCCGACG TTGTCACGCT GGCCCTGGCC TACGACATCG ACGCCATCGC CGAGCAGACG GGCAAGATCC CCGCCGACTG GCAGAAACGC CTGGCCAACA ACAGCTCGCC CTACACATCG ACCATCGTCT TCCTGGTACG CAAGGGCAAC CCCAAGGGCA TCAAGGACTG GGGCGACCTG GTCAAGCCGG GCGTCGAGGT CGTCACCCCC AACCCCAAGA CCTCCGGCGG CGCGCGCTGG AACTACCTCG CCGCCTGGGC CTACGCGCTC AAGCAGCCGG GCGGCAACGA ACAGACCGCG CAGGTCTTCG TCACCGAACT GATCAAGCAC GTGCCGGTGC TGGATTCGGG CGCCCGCGGC GCCACCAACA CCTTCGTCCA GCGCGGCATC GGCGACGTGC TGCTGGCGTG GGAGAACGAG GCCTTCCTGT CGATCAACGA ACTCGGTCCG GACAAGTTCG AGATCGTCGT GCCCTCGATC TCGATCCGCG CCGAACCGCC GGTCACCGTC GTCGATGGCG TGGCGAAAAA GCGCGGCACC GAGAAGCTCG CCCAGGCCTA CCTCGAGTAC CTGTACTCGC CGGTCGGTCA GAAGATCGCA GCCAAGCACT ACTACCGACC GGTCAGGCCC GAACACGCCG ACCCGGCCGA CGTCGCACGC TTCCCGAAAG TCGAGCTGAT CACCATCGAG GACCTCGGCG GCTGGCAGGC GGCGCAGAAG AAGCACTTTG CCGACGGCGG GGTGTTCGAC CAGATCTACG CCAGGTAG
|
Protein sequence | MKLGPLRSTL LTLALATGLA GVAHAQTTLL NVSYDPTREL YQDFNPEFAK HWKARTGETV TIKQSHGGAG KQARAVIDGL EADVVTLALA YDIDAIAEQT GKIPADWQKR LANNSSPYTS TIVFLVRKGN PKGIKDWGDL VKPGVEVVTP NPKTSGGARW NYLAAWAYAL KQPGGNEQTA QVFVTELIKH VPVLDSGARG ATNTFVQRGI GDVLLAWENE AFLSINELGP DKFEIVVPSI SIRAEPPVTV VDGVAKKRGT EKLAQAYLEY LYSPVGQKIA AKHYYRPVRP EHADPADVAR FPKVELITIE DLGGWQAAQK KHFADGGVFD QIYAR
|
| |