Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1789 |
Symbol | |
ID | 8824629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 1821828 |
End bp | 1822931 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003479925 |
Protein GI | 289581459 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.463605 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCAA AAAACACGCA CGGGAGACGG ACGTTCCTTC GATCGACGGC GGCAGCGGGG AGTGTCGCCG CCCTCGGGGG GCTCGCAGGC TGTACAGGAA TGCTCGACGG TGGGGACGAC ACGCTGACCG TTGCGGTCTA CGGCGGTGTG TTCCAGGACG TTATGGACGA GGACCTCTTT GCTCCGTTCG AAGAGGAAAC CGACATCAGC GTCGAGTCAG AGGCACAACC AACATCCGAG GAAGCGCTCA CGCAGTACGA GAACGCCGTT GGTGCCGGCG ATGCGCCAGT CGACGTCGCG ATCATGGCAC AGACCGGTGT CCTTCAGGGA CTAAACTCCG ATCTGTGGCA CATCTGGGAC GACGACGAGT TCGAGAATCT CGAGTACATC AGCGACGATC TCGTCGGTGA GGCCGACGGC GGCATCTCGA GTATCGGTGC GCTGTCGTGG TACATCAACC TCGTCCAGAA TACGGACGTC ATCGAGGAGC CAATCGATTC CTGGGAGGCG CTCTGGGACG ACGAGTACGA AGATACGCTC GGCCTGCTCG GCTACGCGTC GAACTCGTTC CTGCTCGAGG TCACCGCAGA AGTGCACTTC GACGGCCAGG ACATTCTCGA CGACCGCGAC GGCGTCGAGG AAGTGTTCGA GGAACTCGAG GGCGTCACGG ATCAGGCGAA CTTCTGGTAC GAGAACGAAG CGGAGTTCCA GCAGCGTCTC CGAGACGGCG AAGTGCCGGC CGGCATGCTC TACAGTGACA TTACGCGAGT CATGCAGGAC GACGGCGCGC CAGTTCAGTC GAACTTCGTC CAGGAAGGAT CGATTCTCGA CTCCGGGCTC TGGGTCACAC TCGAGACGTC CGACCTCAAG GAGGAGGCGC GCGAGTTCAT CGACTACGCG AGTCAGCCCT CGGTGCAGGA CGAACTCGCA CAGGGACTGT ACACGAGTCC GACGGTCGAA CGTGAGTACT CCGAGATCGA CGACGACTTC TACGAGGAGG TCGCCGGACC AGGACCGGAC GAAGCGATCA CGCCCAAGTA CGAACTCTAC GTCGAGGAAG AGGACTGGGT TAGTGAGCGC TGGGAACAGT TCATCATCGG CTAA
|
Protein sequence | MPAKNTHGRR TFLRSTAAAG SVAALGGLAG CTGMLDGGDD TLTVAVYGGV FQDVMDEDLF APFEEETDIS VESEAQPTSE EALTQYENAV GAGDAPVDVA IMAQTGVLQG LNSDLWHIWD DDEFENLEYI SDDLVGEADG GISSIGALSW YINLVQNTDV IEEPIDSWEA LWDDEYEDTL GLLGYASNSF LLEVTAEVHF DGQDILDDRD GVEEVFEELE GVTDQANFWY ENEAEFQQRL RDGEVPAGML YSDITRVMQD DGAPVQSNFV QEGSILDSGL WVTLETSDLK EEAREFIDYA SQPSVQDELA QGLYTSPTVE REYSEIDDDF YEEVAGPGPD EAITPKYELY VEEEDWVSER WEQFIIG
|
| |