Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3648 |
Symbol | |
ID | 8826516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | + |
Start bp | 32255 |
End bp | 33925 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003481758 |
Protein GI | 289583348 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAGG AAGACACAAC ATCCGACATC GATCGACGCC GATTCATCAA ATCGACGGCT GCTATCGGCG CAGCGGGGCT CTTCGCGGGG TGCGTTGGCA GCGACCCAGA CGAGCCAGGA GACGGGAGTG GGACGTTCCG GATCGCGACC TCTGACGAGG TCGAGACGTT CGATCCGCGA ATGAATCAGA TGGCGTGGTA CAGCACGGCT GCACACTACC TGTTCGACTC ACTCCTCATG ATGGAGCCCG ACGGCAGTGG CACCGTTCCA CACCTCGCCG ATGGCGAACT CGAGGAGGTC GACGAAACGA CGTTCGTTGC AGATATCCGT GATGACGTCA CCTTCCACAA CGGCGACCAA CTAACCGCAG AGGACGTGGC GTACTCGCTC AACTGGGTCC GTGACCCGGA CAACGACTCG CCCAACCTCT CGAACGTCGA GTTCCTCGAC GAAGCCGAGG CAACCGGCGA GTTCGAAGTC ACGCTTCATC TCGAGTACCA GTCTGCACTG ATGGAGCGAA CACTCGCCGG CATGAACGCC GCGATTGTGC CGATGGATGT CGCAGAGGAG ATGGGTCAGG AGGAGTTCGG TCAGAACCCA ATCGGCAGCG GACCGTTCGA ACTCGCGAAC CACGATCCGT CGGCCAGCGT CGAACTCACG GCCAACGAAG ACTACTTCCT CGGTGAGCCA GCGCTAGATG GACTCGAGTA CCGAGTCATT CCGGAGACGG AGGTCGGGTA TGTCGAACTT TCAACCGGTG ACATTCACCA GTCCGGTGTG ACGGAGGCGT TGCTCGACGA GGCACAAGAA GACGACAATA TCGACGTCTA TCAGCTCGAC AACTTCGACT TCCAGGGGTT CCTCGTGAAC TGTCTGGACG GGCCGTTCGA GGATGTGCGA GCGCGTGAGG CACTGCAGTA TCTCGTCGAT TACGACGAGC TGTTGGTCGG TGCGGTCGGC GAACTCGGCA GCCGGAACTG GGGACACATG CCACAGGGTG TCATCGACGC CTGGGACTTC CCTGCAGACG AGTGGGAGGA GCAGTACTAT CCCGAGCAGG ACCACGACCG CGCAGTTGAA CTCTTCGAGG AGGCCGGTCT TGGCACCGAC TTTGACGTCG AGATTACCTC GCAGTCGGGC GAAGCGACGA CTGGACGGGC GACCGTGCTC CAGAACGAGT TCGAGGAAGT TGGAATCAAC GCCACGGTAA GCGAGATTTC AGACGGTGAG TGGCTGGATT CGCTCGATAC CGGTGACTTC GACATCACCA CGTACGGCTG GGGTGGCAAC GACGACCCGG ACGGCTACTA CTACCACATG TTCCGTGACA CCGCGAACGA CGATGGTGGT ATGAGCGACG ATGTCGTCGG CCACTCCTCG ATCGGCTACC TTTACGAGGG CGCTCGTGAG CGTGGCGACG AGGAGACGCT CGAGGATCTC GAGCGGCTCG ACGAGATCGT TCGTGCAGCG CGGCAGACGA CGGACCGAGA CGAACGCTAC GAGTACTACG TCGAAGCTGT CGACCTCCTC ATGCCGCTCT ATCCGGTTCT CGGTGTGTAC TCCGCCGAGA GCGCGACTGG CGTCCACACG GACGTACAGG ATTACGAGCC GAGCCCGTTC GGCGAGCAGG AAGCGTTCAA CCAGTGGCAA GAGGCGCGGA TCGACGACTA G
|
Protein sequence | MSKEDTTSDI DRRRFIKSTA AIGAAGLFAG CVGSDPDEPG DGSGTFRIAT SDEVETFDPR MNQMAWYSTA AHYLFDSLLM MEPDGSGTVP HLADGELEEV DETTFVADIR DDVTFHNGDQ LTAEDVAYSL NWVRDPDNDS PNLSNVEFLD EAEATGEFEV TLHLEYQSAL MERTLAGMNA AIVPMDVAEE MGQEEFGQNP IGSGPFELAN HDPSASVELT ANEDYFLGEP ALDGLEYRVI PETEVGYVEL STGDIHQSGV TEALLDEAQE DDNIDVYQLD NFDFQGFLVN CLDGPFEDVR AREALQYLVD YDELLVGAVG ELGSRNWGHM PQGVIDAWDF PADEWEEQYY PEQDHDRAVE LFEEAGLGTD FDVEITSQSG EATTGRATVL QNEFEEVGIN ATVSEISDGE WLDSLDTGDF DITTYGWGGN DDPDGYYYHM FRDTANDDGG MSDDVVGHSS IGYLYEGARE RGDEETLEDL ERLDEIVRAA RQTTDRDERY EYYVEAVDLL MPLYPVLGVY SAESATGVHT DVQDYEPSPF GEQEAFNQWQ EARIDD
|
| |