Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2372 |
Symbol | |
ID | 8825224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2418413 |
End bp | 2419603 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003480496 |
Protein GI | 289582030 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAAAAT TCACACGACC GGTATCCAGG AGAGGGGTTT TAACGAGTGG TGCAACTGCA GTCGGAGTGG CGGTTGCTGG TTGTATAAGT GACGATACAG ACGGAGCCGG CCAGAGCGGC GAGTCTGGGG GTGGATCGGG CAATGGATCC AGCGAAGAAA CGTACGAGGT TGGATACGGA GACTATCGAA CGACAGTAAA CGCGTCGGCG TTTCCGGACG AACTACGAAT TTACGCGGTC CAAACCGGTT GGTCGAATTG GGACGCCGTA ATGGAGAACT TCGAAAGCGA GTACGGTGTT CCCCTCTACG ACGCACAGGG ATCGTCTGGC GAGGCACTCA CCGACGCACG GTCAAACGCC GGTAATCAGA CACATTCAGC GTTTAACGGC GGCTACTCGT TCGCTCTCGA GGCGATGAAC GATGGCCTGA CGACGGATTA TAAGCCCGCC AACTGGGACG TGGTCCCTGA CGAACTCAAA ACCGACAATG GTCACGTCGT TTCGACTCGA CAGATGACGA CAGCGGTCAC CTACCGTGTT GACATTTATG AGGAACGCGG TCTCGACGCA CCCGAGACCT GGGAAGACCT CAAACACCCA GACATCGCAC AAGATCTGGC CTTCACGCCA CCTCATACAG CTAATGGACT TGCGTCGGCA CTGTCGGTCA ATAGAGCCTA CGGCGGTTCG ATGGCGAATC TAGATCCTGT TATCGAGTAT CACGAGGAAA TCGCCGACCA CGGCGCAGAC ATTCGTCGAA ACATCGAGGG AGACGTTACC AGCGGCGAGA TATCGACCGT CATTGAGTAC GATTACTCGG GACTGAACAT GAAGTACAAC ATGGATGAGA TCGACGAGGA ACAACTCGAG GTCGCAATAT TGACCGGTCC GAGCGGCAGG GAGGGGGCGA TGAACGTTCC GTACGGGTTT GGACTGCTCG AGGGGGCACC AAATCCCGAG GCGGCGAAGT TGTTCATGGA CTACGTGCTC TCGCTAGAGG GTCAGGAGCT GTTCTTCGAC GCGTTCGTCC GCCCGATTCG GGCCGACGAA CTCGAGCAAC CCGAGGAATT CCCCGATCAG TCCGACTACG ACGCAGCCGA GTTCGCCCTC GATCAGGAGG AACTGGTCGC AAACCAGGAG TCGATCCAGC AGGAACTCAC CGAACGAACC CCGCTACCGG GCGCACAGTA G
|
Protein sequence | MAKFTRPVSR RGVLTSGATA VGVAVAGCIS DDTDGAGQSG ESGGGSGNGS SEETYEVGYG DYRTTVNASA FPDELRIYAV QTGWSNWDAV MENFESEYGV PLYDAQGSSG EALTDARSNA GNQTHSAFNG GYSFALEAMN DGLTTDYKPA NWDVVPDELK TDNGHVVSTR QMTTAVTYRV DIYEERGLDA PETWEDLKHP DIAQDLAFTP PHTANGLASA LSVNRAYGGS MANLDPVIEY HEEIADHGAD IRRNIEGDVT SGEISTVIEY DYSGLNMKYN MDEIDEEQLE VAILTGPSGR EGAMNVPYGF GLLEGAPNPE AAKLFMDYVL SLEGQELFFD AFVRPIRADE LEQPEEFPDQ SDYDAAEFAL DQEELVANQE SIQQELTERT PLPGAQ
|
| |