Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0100 |
Symbol | |
ID | 8822919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 123734 |
End bp | 124945 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003478261 |
Protein GI | 289579795 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGGTGT TGAATGTCAC TATGACAAAT AGACGCGCGA TTCTCAAAAC CGCAGGTGTG CTCGGGACAG GGTGTCTCGC AGGCTGTCTC GGTGGGGACA GCGACGGGGA GTCGATTCGC CCCGTTGCTA TCGAGGAGTG GCCGCCAGAA TCATACAACA ACGAACTCAA TACCTGGAAC TGGTACGCCG AGTGGAACGA ATGGGGGACC GAAGCGTTTG CCGAGGAGTA CGATCTCGAT AGCTACTCCA CTGAGACATA CGCGTCACCC GGTGATTGGT TTACCAATCT CCAGTCGAAT CCGGAGAATC ACGGAATCGA TCAAATCGGT GCCTTCTCCG AATGGCAGTA TCGGGCCGTT GAAGAGGACC TGTTAGAACC GATTCCTATC GATATGATGC CGAACGTAGA GATTCCCGAC CAGTACCTCG ACGTCCACCG GGAGCAGTTC TGGAGCGACG ACGGTGCAGG TGGACTCTAC GGTATCCCTC ACTCAATCGT GATCAGCCCG ATCGTCATGT ACAACACGGA GGAATTGCAG GACCCAGGCG AATCGATCGA CATCCTCTGG GACGAGGAGT TTGCCGACGA AATCTCCATG ATCGCTCACC AACCAGCGAT CCTCTGTGAG GCTGGTGCAC TTTACACCGG TCAGGATCCG ACCGATCCGG ACGATTTCGA AGAGATTCAG GAGGTACTCG AACAACAGCG CGACCTCGTG TTCACCTACG CCGATGACCA CCAGACACAG ATGCAACTCG TGATGAGCGG TGATGCAACG CTCGGTTCAC ACATGGACGG CCGAGCGTTC AGGGCGATAT ACAACCATGG CGGTGACGTC GACTGGTTTA TTCCGGAGGA GGGTGCGACC TGGGGGACGG ATACGCTCGT TATCCCGAAA AACGCGCCAA ACCCGGTTAC GAGCACGATG TATCTGGACT ACCTCTTCAG CGACGAAGGG ATGGAGCAGC TGATCGACAC GTCCCTGTAT CGTCCACCGG TTGCCAACGA CGAATTCACT GACGGTGAAC TCGGAGAGAT AATTCGCGAA AACTGGACCG ACGAGTGGGA GAAAGAAGGC GATGCCGAGG ACTTCATCGA CGACCTCGTG TTGACAGAAG AAACAATGGA CAATCTGTAC CACAACTGGC CCCGTTCAGA CGAAGTCATC GAGCGATACG ACGAGATCTG GACGGCAGTT ACCGCGGGAT AA
|
Protein sequence | MGVLNVTMTN RRAILKTAGV LGTGCLAGCL GGDSDGESIR PVAIEEWPPE SYNNELNTWN WYAEWNEWGT EAFAEEYDLD SYSTETYASP GDWFTNLQSN PENHGIDQIG AFSEWQYRAV EEDLLEPIPI DMMPNVEIPD QYLDVHREQF WSDDGAGGLY GIPHSIVISP IVMYNTEELQ DPGESIDILW DEEFADEISM IAHQPAILCE AGALYTGQDP TDPDDFEEIQ EVLEQQRDLV FTYADDHQTQ MQLVMSGDAT LGSHMDGRAF RAIYNHGGDV DWFIPEEGAT WGTDTLVIPK NAPNPVTSTM YLDYLFSDEG MEQLIDTSLY RPPVANDEFT DGELGEIIRE NWTDEWEKEG DAEDFIDDLV LTEETMDNLY HNWPRSDEVI ERYDEIWTAV TAG
|
| |