Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4050 |
Symbol | |
ID | 8828784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | - |
Start bp | 91861 |
End bp | 93507 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003482140 |
Protein GI | 289937538 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCCG ATGGCAATAT GCAGTTTAAC CGAAGAGAGC TAATTAGCGC GCTCAGTGCA GGGGGTGTAC TCGCGCTCGC AGGCTGTGCG GACCAGGCCG ATGGCGATGG GAACGACGAA ATTTTCGTCG ACGCACTCGA CTCCGATCCG GGAACGCTCG ACCTACACGA AACAAACCGC GTGCCGGAGA GTATGTGTCT GACGCCGGTC CACGAGCGAT TGTTTACGAT CGATCCGGAT CTCGAACCGC AGCCGTGGCT GGCAACGGAG TACGAAACGA ACGACGACGA GACCGAGTAC GTAATCCAAC TCGAGGAGGG CGTCGAGTTT CACGACGGAA CTGAGTTCAA TGCAGACGTC GCCAAGTGGA ACCTCGAGCG GGCCGAAGAG AACTCGCCGG ACGCCTGGCA GTTCGGGACG CTCGAAGAGA TCGATGCAAC CGGAGACTAC GAGTTGACGT TCCATTTCGA GGAGCCACAT CCGCTGTTCC CACAATACCT GGCAAACGTC CAGATGGGAT TCGCCTCACG GGAGGCAGTT GAAGCAGCAG GAGACGACTA CGGACAGGAA GAAGTTGTCG GGACAGGACC GCTCGTGTTC GAAGAGTGGG TGCGTGACGA TGAAATCGTC TATTCACGCA ACGAAGACTA CGACCGAGGG CCGGATTTCC TCAGCAACGA TGGCCCGATC AACTTCGAGG AGTACCACGT CAGAATCGTC CCGGAACCAA CGACGCTGCT CAACGAGGTT ACTGTCGGCA ACGTCGACGG AAGCATGATG ATCGCAGCGA GCGATGCCGA AGATGTAGAG GCGCACGATA ACACGCAACT CGAGCGTGTC GACGACGCGC ACCCGACCTT CCTGTCAATC AACGTCGAAG CGGAGCCAAC CGACGAGGTA GAAGTGAGAC AGGCGATGGC ATACGCTGTC GATCAAGAGG CAATAGTCAA CGCTGCATTC CACGGCGAAG GCTATCCAAT CTACAGTCTG TGTCCCCCAA TGGCTGTCGG TGGACTGGAC GAAGCAACCG CACGAGAGAC AGGGTACGAG CAAGACCTCG ACACTGCCCG TGAACTTCTC GACGACGCAG GATGGGAGAA CGACGAAGAA GGAGAAGTCC GGACAAGAAA CGGCGACGAT CTCTCGGTTT CGTTCTTCGC CTTCGAGATG GAGCCCTACT CGAGTATCGG AGAAGTCACA CAGGATATGC TCAGCCAGGT CGGCTTCGAA GCCAATCTAG AGATTCTCGA GTCGGGGACA CTGTACGACA GAGTGGAGGG CGGCGAGCAC AATCTGGTGA CGATGGCACT GAGTGGAGGA TACATTGCCA ACAACACGCT TGCATCGACC CTTCACAGCC AAAACTATGC GCCCGACGGT GGGAGCAATT ACTCGCTGTA CCAGAGCGAC GAGTATGACG AGATCATCGA TCAGGCGGAG GTCGAACCCG ACGATGCCGA GCGAGAGGCG TTGCTTCACG AGGCACAGGA GCACATCCTC GAGGAGGTCC CCGTCGTGCC ACTTGTAGGC TTCGTCAAGT TCTACGCGGC CAAAAACGAG ATCAGCGTGG ATGCCTGGAC CGATCACCCA TGGTGGCCAT CTCCTGATCA GTACAACCTC CATGCGGTGG ATGTTGATCG GAGCTAA
|
Protein sequence | MEPDGNMQFN RRELISALSA GGVLALAGCA DQADGDGNDE IFVDALDSDP GTLDLHETNR VPESMCLTPV HERLFTIDPD LEPQPWLATE YETNDDETEY VIQLEEGVEF HDGTEFNADV AKWNLERAEE NSPDAWQFGT LEEIDATGDY ELTFHFEEPH PLFPQYLANV QMGFASREAV EAAGDDYGQE EVVGTGPLVF EEWVRDDEIV YSRNEDYDRG PDFLSNDGPI NFEEYHVRIV PEPTTLLNEV TVGNVDGSMM IAASDAEDVE AHDNTQLERV DDAHPTFLSI NVEAEPTDEV EVRQAMAYAV DQEAIVNAAF HGEGYPIYSL CPPMAVGGLD EATARETGYE QDLDTARELL DDAGWENDEE GEVRTRNGDD LSVSFFAFEM EPYSSIGEVT QDMLSQVGFE ANLEILESGT LYDRVEGGEH NLVTMALSGG YIANNTLAST LHSQNYAPDG GSNYSLYQSD EYDEIIDQAE VEPDDAEREA LLHEAQEHIL EEVPVVPLVG FVKFYAAKNE ISVDAWTDHP WWPSPDQYNL HAVDVDRS
|
| |