Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3873 |
Symbol | |
ID | 8826743 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | - |
Start bp | 270262 |
End bp | 271653 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003481976 |
Protein GI | 289583566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0964469 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAATG ATACGAAAGA TCATGGAAAT GGCGCAACGC AGCGTGCAAC GTCTCGGGAC ACGTACCGTC GACGAACTGT CCTGAAGGGT GCAACGACGG GGGCGACGAT CGGTGCACTG TCGGTGGCTG GTTGTCTGGG CGGCGGGAAC GGCGGCCCAG TGTTGCGAGT CATCAACTCG GCGTACCAGC AACAAGAGGA CGAGTACCGT GCAATCTTCG ACGAGTTCGA GGAGGAACAC GATTGTGAGG TGGAGTACAC CCGATCCGAT TTCGCATCCG CGCCATCGGA AGCCGCACAG GCACAGGCCG GCGGCAATCC GTACGACCTG CTGATGCTCG CCTCGCCGGG GAACAACGTG TTCGGCGTGC AGGAAGGGCT GTACCAGCCG ATCAACGACG TCATCGAGGA CATGGGTGCA GAGGACCACT GGCGAGAGGA GTTCCTCGTC CAGATGGACG GCGATTACTA CTTCGCGCCG AACACAGGGA CAGTTTCGAC GCTCATCTAC CGCGAGGACC TGTTCAACGA GTACGATGCG CCGATGCCGC CATTTGACTC GTGGGACGAA TACCACACCG CCGCCGAGAC GATGACCGAC GAGGACGAAA ACCTGTACGG GAGCCCGGTC TTCCTCGGGA GCAACCACTT CCACGGAATC CTGCCCCTGT CTCTGCTTCA CGGCCGAGGC GGTTCGGTCA TCAACACAGA CGACGAGGTC GTCTACGACT CCGAGGAGAC GGTCGAGATG CTCGAGTTCA TGCGGGACCT GAACGAATTC AGTCCACAGG CGGCCCACGG TGCTGACATT CCGGAAATGC GTCCGCCGCT GTACCAGGGG ACGTACGCGA TGACGTGGTA CTCCACCAAC GTCATTCCGT ACGACATCGA AGAATACAAC CCGGACCTGA CTGGAGACGT GCAGGTTGCG CCGATTCCAG CGTACGACTC GAGTTACGAG CCGGTTGCGC GGCTGACCGG CCTGGGCCAC GGACTCGGTG CCGAGACCGA ACATCCGGAG CTAGCGAAGG ACCTGTTGCG AGAGATCACC TCGTTCGAAG GCGTCATGCG ATTGATGACC GCCCAGCCGG CGAGTCACGT TCCGGCGATT CACGGCATAC TCGAGGAGGA CGACCTGTGG GAGACGGACG TTATGCAGGA CTACGAAGAG CACTACCGGG ACCTCGTCGA CATCGCAGAC GAGTACGGTC GGGTCGTCGC CGTCGAGGAA AACGAGGGTC ACATCAACCC GGTGACGGGG CAGGCCGTCG CCGAAACACA CGTTATCAGT TCCGTTCAGG ACGTGATCTT GGAGGACGAA GATCCCCAGG ATGCAGCCGA ACATTGGGCG GACGAAATTC GCGATGACTA CGAAGATCAA CTGAACGTCT AG
|
Protein sequence | MGNDTKDHGN GATQRATSRD TYRRRTVLKG ATTGATIGAL SVAGCLGGGN GGPVLRVINS AYQQQEDEYR AIFDEFEEEH DCEVEYTRSD FASAPSEAAQ AQAGGNPYDL LMLASPGNNV FGVQEGLYQP INDVIEDMGA EDHWREEFLV QMDGDYYFAP NTGTVSTLIY REDLFNEYDA PMPPFDSWDE YHTAAETMTD EDENLYGSPV FLGSNHFHGI LPLSLLHGRG GSVINTDDEV VYDSEETVEM LEFMRDLNEF SPQAAHGADI PEMRPPLYQG TYAMTWYSTN VIPYDIEEYN PDLTGDVQVA PIPAYDSSYE PVARLTGLGH GLGAETEHPE LAKDLLREIT SFEGVMRLMT AQPASHVPAI HGILEEDDLW ETDVMQDYEE HYRDLVDIAD EYGRVVAVEE NEGHINPVTG QAVAETHVIS SVQDVILEDE DPQDAAEHWA DEIRDDYEDQ LNV
|
| |