Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2234 |
Symbol | |
ID | 8825084 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2290533 |
End bp | 2291792 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003480361 |
Protein GI | 289581895 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGACG ATACCATGAA CGTCGACCGT CGCGGTGTGC TCCGTGGCGT CGGCGGCGGG GCGATCGTAC TGGCGGGACT GGGCGGCGCT GGCAGTGTCA GCGCACAGGA TGCAATCACA GTCACGGCGG TCTGGACCGA CGACGAGGAA GAAGACTTCC TCGCTGTGGT CGACTACGTC GAGGACGAGA CCGACATCGA CATCTCGTAT GCGCCACGAG ACACCGAAAC GCTCCTGACA GAGACGCTGA TGGACTACGA GGCCGGCATC GCGACGGCGG ATATCGTCGT GTTGCCGACC GAGGGACGTG TCCGACGGGA CGGCGAAGCG GGGCACCTCG AACCGGTTGG TGACCTGTGG GACGAAGACG AGTACTCGAC CGAGCACGCG GTGGTCGAGG CGAACGGCGA GGTCTACGCC GCTCCGTTCG GGATGGACCT CAAGCCGGGC TTCTGGTACC GCCAGTCGTT CTTCGATGAA CACGGACTCG AGGAGCCCGA GGATTACGAC GCCTTCCTCG ACCTCCTCGA CGAGATCGAC GGCATCGAAG GCGTCGAGGC ACCGCTCGCG TCCGGGAACG GCGACGGCTG GCCGCTCAGC GACGTCACGG AGGCGTTTAT CCTTCGGCAG GACGACGGCG CACAGCTCCA GCAGGACCTC ATCGAGGGCG ATGCCGAGTT CACCGACGAC CGCGTCGTCA CGGCCTTCGA GGAACTACAG GAACTCTTGC AGGCGGGCTA CTTCAGCGAG GTCCGTGATT TCGGTGTGCA GTACGAGTTC TTCTGGGAGA ACGAGACGCC CCTGTACTTC ATGGGGTCGT GGACACCAGC CTTCGGCGCA ATCGAGGATC CAGACGACCT CGAGTACTTC ATGCTCCCGG GTACCGATGC GATGGTGACC AGCATCAACT GGTTCACCAT CCCCGCGTAC ACGGAGGCGA CTGACGCGGC CAGAACCGCC GTCGAGGAAA TCATCTCTCC CGACGGTCAG GAAGTCTGGA CCGAACGCGG CGGTTTCGTT CCGTCATCGC TCGAGGTGCC GGCAGACGCG TTCGACCACG ACATCATGCA GGAACTGTCC GAACACGCTG ACGAGGTCGA ACTCGTCCCC GACCTCGACG ACGCGGTCGG CGATCCGTTC CAGGCCGAGT TCTGGTCGCA ACTGCTCGGT CTCTGGGCCG AACCAGACCA GGACGTCACC GGCATCACCG AGTCGCTCGA CGGCGTCTTG CAAGAAACCG TTCAGGAGGA CGACCCATAG
|
Protein sequence | MGDDTMNVDR RGVLRGVGGG AIVLAGLGGA GSVSAQDAIT VTAVWTDDEE EDFLAVVDYV EDETDIDISY APRDTETLLT ETLMDYEAGI ATADIVVLPT EGRVRRDGEA GHLEPVGDLW DEDEYSTEHA VVEANGEVYA APFGMDLKPG FWYRQSFFDE HGLEEPEDYD AFLDLLDEID GIEGVEAPLA SGNGDGWPLS DVTEAFILRQ DDGAQLQQDL IEGDAEFTDD RVVTAFEELQ ELLQAGYFSE VRDFGVQYEF FWENETPLYF MGSWTPAFGA IEDPDDLEYF MLPGTDAMVT SINWFTIPAY TEATDAARTA VEEIISPDGQ EVWTERGGFV PSSLEVPADA FDHDIMQELS EHADEVELVP DLDDAVGDPF QAEFWSQLLG LWAEPDQDVT GITESLDGVL QETVQEDDP
|
| |