Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3201 |
Symbol | |
ID | 8826064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 3317694 |
End bp | 3319610 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003481313 |
Protein GI | 289582847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTCGA TGTCCGGAGA CGCTGACACC GGCTACAGCC GTCGCTCGCT CCTCGCTGCC GGCGCGACTG GCCTCTCGAT TGCTGCCAGC GGTTGTATCG ACCGTGTCCA GAGCGTCGTC GACCCTGATG CCTCCGAACA GCTCTCACTG TCTATCCTCT CGCTCCCCGC TGCCGACGGT GACAGAGAGA CCGCCGAAAT CGCACGTCAT CTGGAGTCCA ACCTCGAAGC GGTCGGCATC AATGCGACGA TCAGCACTCG CTCCGAGTCC GAGTTCCTGG AGGCGATTCT GATCGACCAC GACTTCGACC TCTACGTGGG CCGCCACCCG GCTGACTTCG ACCCGGACTT TCTGTACGAG ACGCTGCACT CCACCTACGA GTACGACCGA GGCTGGCAGA ACCCGTTTGG CTTCACCGAC CCGCTGTCGA TCGATCCGTT GCTCGAGCGC CAGCGCAACG AAACTAGCGC CAATCGCGAG CGGACTGTCA CGGCGCTCCT CGAGGCAATT GCACAGGTGA AACCGTTCGA ACCGATCTGC GTGTCGAACG AGTACCGCGC CGTCCGAACT GATCGGTTCG ACGGCTGGGA CGAGACCCAC CTCGGAACCG GACGGGGCTA CCTCGGACTC GAACCTGACG CTGCGTCCGA CGCCGAACAA CTTCGTGCGC TCGTCACGGA CCAGCGGCTG ACCCAGACCT TCAACCCGCT CTCCCCGACG ACGCGCGAGC AGGGGGTCAT CGTCGACCTG CTGTACGACT CGCTTGCCGT TCCAACGTGG TCCGTTCCCG ACGAGGCTGA GGCAACGGAC GAGGAGACGC GAGCGGGTGC AGAAGTGATC GACGCGGCAA ACGTCTCGAT CGAACCCTGG CTCGCGACCG ACTGGGAGTG GGACGGTGAG GACCGAACCA TGACCGTCGA TATTCGCGAG GACTGTCTGT TCCACGATCA CAGCGAGGAC GGCGACGACA CTGACGCAGA CACCGGCGAT AGCAACGGTA ACGATAGCGC CATCCCCCTC ACCGCCGAGG ACGTCAGGTT CACCTATCGA TTCCTCGCAG ACACGACGTA CGGACGCACC TCGAGTGCTT CCCCCGCACC ACGCTATAAG GGCCATGCAA GCGCAGTCGA CTCGGTCACC GTCGAAGACG ACTACCGAGT CACAATTTCG TTCTCGACGG ATCGGGCCGT CGCCGAGCGT GCGTTGACGG TCCCGATCCT CCCATCCGGC TCTGACTCAC CCTGGATCAC CCATCTCGAG AACAGCGTCG AGTCCACAGA CGACGACTGG TCGCCGACGC AGGGTGACTG GAGCATCGTG ACGACCGGTC ACGCGCCACC GACAGGTAGC GGTCCCTACA AGTATGACTC CCACGACGCG GGAGAGTCCG TCACGCTCAC GCGCTTTGAG GATCACTTTA CGCTCCGGGA CGACGATCAC GACCTGCCAG CGCCTACCGC CGACGAACTT CACTTCGAGG TCGACCCGGG AACGAGTTCG TCGATCGAAC GGGTCGCAAG CGGCGACGCC GATATCACGA CATCGGTACT CGATACCTAC GCACTCGAGT CGACGGTCGA CGACGACGCT GTGTCGTTCG TCGAGTCGCC GTCGTGGTCG TTCTATCACG TCGGCTTCAA CACGCGCAAC ACGCCGTGTG GCAGCCCCAA CTTCCGTCGT GCGGTCTCGC AACTGATCGA CAAGCAGTGG ATCGTCGACG AGGTGTTCGC TGCCGAGGAC AGTGCACGGC CCGTCGCGAC ACCGGTTGCC GAAGAGTGGA CGCCCGACTC GCTCGCCTGG GACGATGGCG AGTTGGTGAC GACGTTCGTC GGCGAGGACG GGGAACTCGA CGTCGACGCC GCGCGAGAAC TGTTTGCATC ACACTTCGAG TACGACGACG GAGCGCTGGT CCGGTAG
|
Protein sequence | MNSMSGDADT GYSRRSLLAA GATGLSIAAS GCIDRVQSVV DPDASEQLSL SILSLPAADG DRETAEIARH LESNLEAVGI NATISTRSES EFLEAILIDH DFDLYVGRHP ADFDPDFLYE TLHSTYEYDR GWQNPFGFTD PLSIDPLLER QRNETSANRE RTVTALLEAI AQVKPFEPIC VSNEYRAVRT DRFDGWDETH LGTGRGYLGL EPDAASDAEQ LRALVTDQRL TQTFNPLSPT TREQGVIVDL LYDSLAVPTW SVPDEAEATD EETRAGAEVI DAANVSIEPW LATDWEWDGE DRTMTVDIRE DCLFHDHSED GDDTDADTGD SNGNDSAIPL TAEDVRFTYR FLADTTYGRT SSASPAPRYK GHASAVDSVT VEDDYRVTIS FSTDRAVAER ALTVPILPSG SDSPWITHLE NSVESTDDDW SPTQGDWSIV TTGHAPPTGS GPYKYDSHDA GESVTLTRFE DHFTLRDDDH DLPAPTADEL HFEVDPGTSS SIERVASGDA DITTSVLDTY ALESTVDDDA VSFVESPSWS FYHVGFNTRN TPCGSPNFRR AVSQLIDKQW IVDEVFAAED SARPVATPVA EEWTPDSLAW DDGELVTTFV GEDGELDVDA ARELFASHFE YDDGALVR
|
| |