Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3647 |
Symbol | |
ID | 8826515 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | + |
Start bp | 30363 |
End bp | 32033 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003481757 |
Protein GI | 289583347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0862095 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCAATG AAGACACGAC ATCCGACATC GATCGACGCC GATTCATCAA ATCGACGGCT GCTATCGGCG CAGCAGGGCT CTTTGCCGGG TGCGTTGGAA CCGATCCAGA TGAAACCGGT GATGGGGGTG GGACGTTCCG GATCGCGACA CCAGACGAGG TCGAGACGTT CGATCCGCGA ATGAATCAGA TGGTGTGGTA CAGTACGGCT GCACACTACC TGTTCGACTC GCTTTTCATG TTGGAACCGG ACGGCAGTGG CGCGGTCCCA CACCTTGCCG ACGGCGAACT CGAGGAGGTC GACGAAACGA CGTTCGTCGT CGACATCCGC GATGACGTCA CCTTCCACAA CGGTGACCAG CTGACCGCGG AGGACGTGGC GTACTCGCTC AACTGGGTCC GTGACCCGGA CAACGACTCG CCCAACCTCT CGAACGTCGA GTTCATCGAC GAAGCCGAGG CGACTGGTGA GTTCGAGGTC ACACTCCACC TCGACTTCCA GTTCGCGCTG ATGGAACGGG AGTTATCCTC GATGAACGCC GCGATCGTTC CAATGGATGC CGCCGAGGAG ATGGGACAGG AGGAGTTCGC TCAGAACCCA ATCGGCAGTG GGCCGTTCGA ACTCGCGAAC CACGATCCGT CGGCCAGCGT CGAACTCACG GCCAACGACG ACTACTTCCT CGGCGAACCC ACCCTCGGCG GGGTCGAGTA CCGGATCATT CCAGAGGCAG AGGTTGGGTT CGTCGAACTG GCGGACGGTA CTATTCACCA GTCGAGTGTG ACGGAAGCCC TCGTCGACGA AGCTGAATCG AACGACAACG TCAACACGTA CCAGATCAGT GATTTCAACT TCCAGGGGTT CATCATCAAT TGTCTGGAAG GCCCGTTCGT GGACACCAGG GCTCGCGAAG CCGTTCAGTA TCTCGTCGAC TATGACGAAC TGCTGACTGG TGCGGTCGGT GACCTCGGTA GTCGAAGTGT TGCCCACATG CCCCCGGGTG TCGCCGAAGC ATGGGATTTC CCAGCCGACG AGTGGCGAGA ACAGTACTAC CCGGAGCAAG ACCACGACCG TGCCGTCGAA CTGTTCGAGG AGGCTGGACT CGGCACCGAC TTTGAAGTTG ACATTGTGAC CATGTCTGGC GAGTCGACGA CGGGACGATC CACTGTCTTA CAGCACGAGT TCCAGGAAGT CGGAATCGAT GCATCGGTCA GAGAGGTCTC AGACGGCGAA TGGCTGGATG CACTCGATAC CGGGGACTAC GACATCAACA CCTATGGCTG GGGCGGCGGC GACGACCCGG ATGGCTACTA CTACCGGATG TTCCGTGACC TCGCAAACGA CGACGGTGGC ATGAGCGACG ATGTCGTCGG CCATTCGTCG ATCGGCTACC TCTACGAGGG CGCTCGAGAT CGCGGCGACG ACGAATTACT CGACGAGCTC GAACGACTGG ACGAACTCGT TCGTGCAGCG CGAGAGACGA CGGATCGAGA CGAACGCTAC GAGTACTACG TCGAGGCAGT CGACCTCCTC ATGCCACTGC ATCCGGTCAT CGGCGTCTAC TCCGCCGAGG GTATAACTGG CGTCCACACG GACGTACAGG ACTACGAGCC GAGTCCGTTC GGCGAGCAGG AGGCGTTCAA CCAGTGGCAA GAGGCGCGGA TCGACGACTA G
|
Protein sequence | MFNEDTTSDI DRRRFIKSTA AIGAAGLFAG CVGTDPDETG DGGGTFRIAT PDEVETFDPR MNQMVWYSTA AHYLFDSLFM LEPDGSGAVP HLADGELEEV DETTFVVDIR DDVTFHNGDQ LTAEDVAYSL NWVRDPDNDS PNLSNVEFID EAEATGEFEV TLHLDFQFAL MERELSSMNA AIVPMDAAEE MGQEEFAQNP IGSGPFELAN HDPSASVELT ANDDYFLGEP TLGGVEYRII PEAEVGFVEL ADGTIHQSSV TEALVDEAES NDNVNTYQIS DFNFQGFIIN CLEGPFVDTR AREAVQYLVD YDELLTGAVG DLGSRSVAHM PPGVAEAWDF PADEWREQYY PEQDHDRAVE LFEEAGLGTD FEVDIVTMSG ESTTGRSTVL QHEFQEVGID ASVREVSDGE WLDALDTGDY DINTYGWGGG DDPDGYYYRM FRDLANDDGG MSDDVVGHSS IGYLYEGARD RGDDELLDEL ERLDELVRAA RETTDRDERY EYYVEAVDLL MPLHPVIGVY SAEGITGVHT DVQDYEPSPF GEQEAFNQWQ EARIDD
|
| |