Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3996 |
Symbol | |
ID | 8828730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | - |
Start bp | 36475 |
End bp | 38166 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003482091 |
Protein GI | 289937489 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAATG GAAACACGCA ACTTGATCGA AGGTCGTTTT TAAAAAGAAC CTCCGCTGCT GCCGGTGCTG CGTCGATAGC AGCAACTGCC GGCTGTCTCG GCGGCGAGAA TGGTGGCGAT AGCGATATCG ACTACGACGA CGAAGAAGAC GCAGGTGAAC CCTTCCCAGA ATATACGTTC TACAACAACC CACAGGATTA CAATCCGCAA CGTCACGACC TGATCAACCT TATTGCTGAA CAGTGGCAGG AAGTTGGCTT CGATGTCGAA GTCGAGGTAC TCGAGTGGGC GACACTCCTC TCTCGAGTCT CCGACGAGTA CGAGTTCGAT TTCGCTGCCT GGTCGCAATA CCAGTCGCCC GATCCTGCGG AGAACGTAGC GGATCGGTGG TCACCAGAGC ATGCAGAGGA GCCTGGACGA GGTAATTACA ACCAGTACCA GAACGATCGG GTTGGTGAAC TCATCGACGA ACAGTTGGCG GCTGAAGAGA TGGAGGGACG GGTAGATGCC TTCCACGAGA TCCAGGATAT CCTCGCAGAA GATGCTCCGT TCAGTCCGGT CTGCTACGAA ACACAGCTTA TTCCCTACCG AACCGACGAA CTTGATGGGT GGGTCGAGCA TCCAGCTGGT CCGGACCGAA TTCATCAGTA TGCCAACGTC GAACCGATGG ATCAGAACGA AGACGGATAT CTCCGTGGAT TCTGGACTGA GGCACTCGAG AACCTGAACT CGTGGTCTCA CGAAGGCCTG AGCAAACATC TCCACATACA GGACGCAATA CAACTTCGGG TAGCCCAAGT TGACGCGGAC CTCGAACTCG ATCCTGAACA CGGACTCGCG CAGGATATAG AGCGACCGGA CGAGACGACG ATTCGATTCG AGATTCGTGA CGATGTCGAG TGGAGCGACG GCGAACCGCT TACTCCCGAC GACGTCGCAT TCACCTACAA TACGATCGCT GAACAGGAGC CATCGACATA CACCACTGTT TCCAACTACG TCGAAGGGGC GTCGGTTGAT GGCGACTGGG TCGAAGTCGA CCTCTCACAG GAGATTGGCA GAGCCGCACT GTTGCTCATT GGGTATGAAG TGTACGTTGC TGCCGAGCAC GTCTGGGAAG GCACTGATCC CGTTCAAGAC GAACTCGTTG AAGAACCAGT CTGCAGTGGT CCGATGGAGG TGGACTACTG GGATGTCGGT CAGGGGATCG AGTTAGAGAC GAGAGACGAT CACCCGATCG ATCTCGCAGT TGATGGGTTG TTCTGGGAGA TCATTCCTGA GAGTTCGACG ATCTGGTCGT ACACCGAAGA TGGGACGATC AACTACCATC CGTTTGCACA GCCCGGACTC GAGCTACAGG ACGGAGAGGA GGAAATAGAC GACTTCGAAG TATTCGAGGC CCCTGGGGAT GGATGGACAC ACCTCAATAT CAATACGACG AGCGAAGGGT TGGACGAAGT CGCTGTTCGA CAAGCACTCG TTCACGCTCT CCCCAAAGAG GCGGCTAGCG AACAGTTGTT CTACGGCTAC ATGCCTGTCG CACACTCCTA TGTCGCGCCC GCATTCGGAC CGCTTCACCG CGAACACGAA GACCTCCCGT TCACTGGCGA AGGGACCATT GCAGCGGCGG CAAGCCATCT GCAAGAGAAC GGGTACGTGG TAACCGAGGA CGGCGTCTAC TATTCAGAGT AG
|
Protein sequence | MGNGNTQLDR RSFLKRTSAA AGAASIAATA GCLGGENGGD SDIDYDDEED AGEPFPEYTF YNNPQDYNPQ RHDLINLIAE QWQEVGFDVE VEVLEWATLL SRVSDEYEFD FAAWSQYQSP DPAENVADRW SPEHAEEPGR GNYNQYQNDR VGELIDEQLA AEEMEGRVDA FHEIQDILAE DAPFSPVCYE TQLIPYRTDE LDGWVEHPAG PDRIHQYANV EPMDQNEDGY LRGFWTEALE NLNSWSHEGL SKHLHIQDAI QLRVAQVDAD LELDPEHGLA QDIERPDETT IRFEIRDDVE WSDGEPLTPD DVAFTYNTIA EQEPSTYTTV SNYVEGASVD GDWVEVDLSQ EIGRAALLLI GYEVYVAAEH VWEGTDPVQD ELVEEPVCSG PMEVDYWDVG QGIELETRDD HPIDLAVDGL FWEIIPESST IWSYTEDGTI NYHPFAQPGL ELQDGEEEID DFEVFEAPGD GWTHLNINTT SEGLDEVAVR QALVHALPKE AASEQLFYGY MPVAHSYVAP AFGPLHREHE DLPFTGEGTI AAAASHLQEN GYVVTEDGVY YSE
|
| |