Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0005 |
Symbol | |
ID | 8822819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 6750 |
End bp | 8549 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003478167 |
Protein GI | 289579701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.369822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGG TCGCTCCGTC GGTGACACGA CGGTCCCTCC TTGCCGGCGC AAGCGGTGTC GGGTTGAGCG CACTCGCCGG GTGCTCGGAG CGATTCTGGT CGCGAGCGGA GAACACGGGC CCTGATCAGG TCGAACTCAC GATTAAGACG GTCCCTGCCG ACGACGACGC GATCGCGGCG ACGATCATGA GTCAACTTCG CGAAAATTTC CAGGCCGCTG GCATCGATGC GACCCACGAG CCGATCGCCG AAGCGGAACT GTACCGTGAC GTTCTCATCG ACGGCGATTA CGACGTCTTC GTCCTCAAAC ATGCCGGACT CGACGAGTAC GACGCCCTCC ATGGACTGCT TCACTCGCAG TTTGTCACCG AACGGGGCTG GCAGAATCCG TTCCAGTTCT CCGATATTCA CGCAGACGAG TTGCTCGACG AGCAACGGGC AACTGACAGG GCAGAGCGCG AAGAGACACT GACTGAACTG TTCAACTATC TGTTAGAGAC GGTGCCGTTC ACCGTCGTCG CCTACCCCGA CCGCCTCGGC GGTACTCGCG AGACACTCTC GGTCTCACGA CCACCGCGGC GCGCAATCGA AATCATCGAT ATTCTCTCAC AGGAGTTCGA GGATGGCCCC TTCGACCGGC CGCTCGAACT CGGCATCTAC GGTGAGGCGC TGACGGACCG ATTGAATCCT CTCATCGTCG ACCGTAACCG AGTGCAGGGG TTGATGGAAT TACTGTACGA CCCGCTCGCC CGCCAGTCAC TAACTGACGA GTCAGAGCCG GGCGAGCCAG CGACGTACAG CCCGTGGCTC GCCGAAGAAA TCGAGTGGGA CGAGGAGAGT TCGCTCACCG CGACGGTGAC GCTTCGTTCT GACCTTCACT GGCACGATGG CGAACCGCTC GATGCCGACG ACGTCGATTT CACGTTTCGG TTCCTCGGGG ATACGTCCCT CGGAGCCGTC GATGGAACGA TCCCGGCACC CCGGTATCGG GACAGACAGA CGCTTATCGA CGACAGTGAT CGGATCGAGG TGCTCGACGA TCGAACAGTT GTGCTCCCGT TCGGCTCTAC CGCTCGCCCG GCTGCGACGC GTATCCTCTC GGTCCCGATC CTGCCCCAGC ACATCTGGGA GCCACTATCC GAGGTCATCG CCGAGCGCCA AACCAGGGCA CTCGTCCACG ACAACGAGGA ACCAGTCGGC TCCGGTCTCT TCGAACTCGT CGAAACATCG GCGGACGAAC TCGTCCTCGA GCCGTTCGAA GATCACGTGC TTCGCTCGTC TACTGTTCCA AATCGCCCCA GTATCCTGGA ACACTTCTCG CAGTTCGAGG GCATTCGATA CCGAATTGAT CCGAACGTCG GTGCCATGCT CGACGCGCTC GAAGACGGAG AGATCGATGT CACGGCCGGT GCAGTCCCTC CCGACTCGGC CGCACAACTC ACCGATGCAG ACGATGTCTC GATGGTCACC ACCTCGACGA GTTCGTTCTA CATGGTCGGG TACAACATCC ACCACCCCGA ACTCGGCAAT CCAAACTTCC GACAGATCGT CTCACAGTTA ATCGACCGCG AGTATGTCGT CTCTGAGTTC TTCAACGGCT TTGCATCAGA ACCCACACGG AGCACTGGAC TCTTTGGCTA TCAGCAGTAC AGTCAGGACG ACGCTACCCT CGAGGGCACT ACATCGACCA ACCCCATCTC GGTATTCCCG GGATCTGACG GCGAAATCGA TCCCGAACGC GTCAGGCAAC TGTTCACCGA GGCCGGCTAC CAGTACGACG ATGACGTGTT ACTCGAGTAA
|
Protein sequence | MKRVAPSVTR RSLLAGASGV GLSALAGCSE RFWSRAENTG PDQVELTIKT VPADDDAIAA TIMSQLRENF QAAGIDATHE PIAEAELYRD VLIDGDYDVF VLKHAGLDEY DALHGLLHSQ FVTERGWQNP FQFSDIHADE LLDEQRATDR AEREETLTEL FNYLLETVPF TVVAYPDRLG GTRETLSVSR PPRRAIEIID ILSQEFEDGP FDRPLELGIY GEALTDRLNP LIVDRNRVQG LMELLYDPLA RQSLTDESEP GEPATYSPWL AEEIEWDEES SLTATVTLRS DLHWHDGEPL DADDVDFTFR FLGDTSLGAV DGTIPAPRYR DRQTLIDDSD RIEVLDDRTV VLPFGSTARP AATRILSVPI LPQHIWEPLS EVIAERQTRA LVHDNEEPVG SGLFELVETS ADELVLEPFE DHVLRSSTVP NRPSILEHFS QFEGIRYRID PNVGAMLDAL EDGEIDVTAG AVPPDSAAQL TDADDVSMVT TSTSSFYMVG YNIHHPELGN PNFRQIVSQL IDREYVVSEF FNGFASEPTR STGLFGYQQY SQDDATLEGT TSTNPISVFP GSDGEIDPER VRQLFTEAGY QYDDDVLLE
|
| |