Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3203 |
Symbol | |
ID | 8826066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3320597 |
End bp | 3322522 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003481315 |
Protein GI | 289582849 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGACC CCGCCCCGTC AACGAGCCCC GACAACCACA CACGCGACAG CGACGACGCC GCTCGCGGCG GTCCCAGTCG CCGGTCGATG CTCGCCGCGA CAGCCGGTCT CACCGTCTCG ATGAGCGGCT GTATCCGCGA AGTACGAAGC ATCGTCAACC GCGACGAGGT TGCCCCACTC TCACTGACGA TTTCGACGGT TCCCGCCGAC GGCGACCGAG AGAGTATCCA GCTCGCCCGC GCCGTCGGCT CGGCACTCGA GTCCGTTGGC GCCGACATCG CGATCGATAT GCTGTCTCAA GAGGAGTTTC AGCGTGCGAT CCTCGTCAAT CATGACTTCG ATCTCTACGT CGGTCCCCAT CCCGGGGACA CGACGCCTGA CTTCCTTTAT GAACTCCTGC ACTCGCGATT CGCGGAGGAA GCCGGCTGGC AGAATCCCTT CGGACTGACG AATCTCGCTA TCGACGACCG GCTGGACGAG CAGCGTGTCG CCACAGGCGT AGCCGCAGAC GCAGACGGGA ACGGGAACGG GAACGGGAAC GGAGACGAAG GCGAAAATGG AGACGAAAAC GAGACCAGCG ACCGCGAGGA CGCGATCACG GCAACACTCG AGACGGTTGC GCGCGAGCAG CCGTTCGTCC CGATTTGTCG GCCGACGGAG ATCAGACTCG CGCGCTCGGA TCGATTCCAG GGCTGGGGAG AGGGGCATCC AGCGACTCGA TTTGGCTATC TTGGTCTCGA GCCACAGTCT GATGGGGCGT CTGCGGAGCT TCGGGCAGTA CACACGGACG CACGGCCGTC ACAGAATCTG AATCCGCTGT CGGCGGAGTA TCGGGAGCGT GGTCTCTCGA CCGAGTTGCT GTACGACTCG CTTGCAGTGG CGCAGTGGCG TGGTGCTGGC GACCGTGAGT CGGAGACCGA CACCGCAACC GATGTCAGTC CCTGGCTCGC GAGTGACTGG GAGTGGAGCG ACGACGACAC CCTCCTCGTC ACCCTCCGGG AGGACTGTGA ATTCCACGAC GGCACGCGAC TCACTGCCGA GGACGTCGCC TTTACCTACG AGTTCCTCGC GGACACGACC CGCGGCAGTC GGCGCTTTGC CGCACCAGCC CCCCACTACC GCGGCCGCGT CGCCGCCGTC GACGACGTGA CCGACCTGAG CGAGACCGAA CTCGAGTTCA CCGTCGCAGC GAGCCAGCCG GTCGCCGAGC GCGCACTGGA GGTGCCGATC CTCCCGAAAC ACATCTGGGA CGAGCGCACG GACCAGGCGA GCCTCCGCGG CGTCACGGTT TCGGAGGGGA CGACCGAAGC GCTCGTCACC GACAACGTGC CCGCAATCGG CAGCGGCCCC TATCAGTTCG ACGAGCGGAC CGACCGTGAT CACCTTACGC TCGCACGCTT TGACGCACAC TTTACGCGCC GCGACGGCGT CGATCTGCCG GAACCGACAA GCGAGCACCT GCGCATCCAG ATCGACCCGC GGAGCACGTC CGCGATCGAA CTCGTCGAAA GCGACAGTGC CGACGTGACG TCTTCGCCAC TCGAGTCCTA CGTCGTCGAT GACGTGTTCG AAGACGACGA GGCAATTTCG GAAACGACGG TACTCGAGTC ACCATCCTGG TCGTTCTACC ATATCGGGTT CAACACGCGT CGAGCGCCGC TTTCCAATCC GCGGTTTCGC AGGGTCGTTG CCCGACTCCT CGACAAGGAA CAGTTGGTCG AGGAGGTGTT CCACGGCCAT GCACAGCCGA TCGCAACGCC GGTGGTCGAG GAGTGGGTTC CCGACTCGCT CGCCTGGAAC GGCAGCGATC CGGAGACGCC GTTTTTCGGG ACTGACGGGG AGCTCGATGT CGCGGCTGCG AGGGATGCGT TCGAAGATGC GGGACTGCGC TACGACGATG ACGGAAGACT CCGGGTGAGA CACTGA
|
Protein sequence | MSDPAPSTSP DNHTRDSDDA ARGGPSRRSM LAATAGLTVS MSGCIREVRS IVNRDEVAPL SLTISTVPAD GDRESIQLAR AVGSALESVG ADIAIDMLSQ EEFQRAILVN HDFDLYVGPH PGDTTPDFLY ELLHSRFAEE AGWQNPFGLT NLAIDDRLDE QRVATGVAAD ADGNGNGNGN GDEGENGDEN ETSDREDAIT ATLETVAREQ PFVPICRPTE IRLARSDRFQ GWGEGHPATR FGYLGLEPQS DGASAELRAV HTDARPSQNL NPLSAEYRER GLSTELLYDS LAVAQWRGAG DRESETDTAT DVSPWLASDW EWSDDDTLLV TLREDCEFHD GTRLTAEDVA FTYEFLADTT RGSRRFAAPA PHYRGRVAAV DDVTDLSETE LEFTVAASQP VAERALEVPI LPKHIWDERT DQASLRGVTV SEGTTEALVT DNVPAIGSGP YQFDERTDRD HLTLARFDAH FTRRDGVDLP EPTSEHLRIQ IDPRSTSAIE LVESDSADVT SSPLESYVVD DVFEDDEAIS ETTVLESPSW SFYHIGFNTR RAPLSNPRFR RVVARLLDKE QLVEEVFHGH AQPIATPVVE EWVPDSLAWN GSDPETPFFG TDGELDVAAA RDAFEDAGLR YDDDGRLRVR H
|
| |