Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_0096 |
Symbol | |
ID | 8822915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 117996 |
End bp | 119192 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003478257 |
Protein GI | 289579791 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.265254 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAATC GACGCGACAT CATCAAAGGT GCGGGAGCAG CTAGCATAGC GGGTCTGGCC GGGTGTCTCG GTGGAGACAA CGGCGGCAGG GACATCAGAC CGGTCGAGAT CGATTTCGAC GACTGGCCGC CAGAGGAGTA CGGCGGCAAT CTTAACGCCT GGAACTGGTA CGTCGAGTGG AACGAGTGGG GAGCCGAGGA TTTCGCCGAG GAGTACGACC TCGACAGCTA CTCCACAGAG GCGTACTCGA CGCCAACCGA CTGGTTCAGC AATCTACAAG CCAGCCCAGA GAATCACGGG ATCGATCACA TCGGTGCCTT CACGGAGTGG GTTCACCGCG CCCGTGAGGA AGAGATGATC GAACCGATAC CGATCGACGA GTTACCCAAC GTCGAGGTCG CTGATCAGTA CCTCGATCCA CACCGTGAAC TGTTCTGGAG CGACGACGGC GTCGGTGGCG TCTACGGACT ACCCCACTCA GTTGTGATCA GCCCGATCGT GATGTACAAC ACCGAGGAGG TCGAGGACCC GCCCGAGTCG CTCGACATCC TCTGGGATGA GGAGTACGCG GATGAGATTT CGATGATGGC ACACCACGGC GGGTTCCTCT GCGATGTCGG AGCGCTGTAC ACCGGCCAGG ATCCGAACGA TCCGGACGAC TTCGAAGAGA TCCAGGAGGT ACTCGAGCAA CAGCGCGACC TCGTCTTCAA CTACGCCGAC GAACACGAGA CACAGATGCA ACTCGTGATG AGCGGTGACG CTGCCCTTGG AACGCACACA GACGGGCGAG CGTTCAGGGC GATGTACAAC CAGGGCGGCG ACGTCGACTG GTTCATCCCG GAAGAAGGCG CGACCTGGGG GACAGACGTG ATCCTGCTGC CACAAAACGC TCCGAACCCG GTAACGGCGA CGATGTACAT CGACCACCTG TTCACGGATA CTGGCTGGGA GAAGTTTGTC GAAACGACGG TGTACCGACC GCCGTTCGAA AACGAAGAGT TCACCGACGG GGAACTCGGC GACGCCATTC GCGAAAAGTG GGACGATGAG TGGGACAAAC ACGGCGAGGC GGAAGATTTC ATCGACGACC TGGTTATCAC CGACGAAGAG TTCGACCGGA TGCACCACAA CTGGCCCCGC TCGGACGACG TCATCGAACG GTACGACGAG ATCTGGACCG AAGTCACCGC CGGATAG
|
Protein sequence | MVNRRDIIKG AGAASIAGLA GCLGGDNGGR DIRPVEIDFD DWPPEEYGGN LNAWNWYVEW NEWGAEDFAE EYDLDSYSTE AYSTPTDWFS NLQASPENHG IDHIGAFTEW VHRAREEEMI EPIPIDELPN VEVADQYLDP HRELFWSDDG VGGVYGLPHS VVISPIVMYN TEEVEDPPES LDILWDEEYA DEISMMAHHG GFLCDVGALY TGQDPNDPDD FEEIQEVLEQ QRDLVFNYAD EHETQMQLVM SGDAALGTHT DGRAFRAMYN QGGDVDWFIP EEGATWGTDV ILLPQNAPNP VTATMYIDHL FTDTGWEKFV ETTVYRPPFE NEEFTDGELG DAIREKWDDE WDKHGEAEDF IDDLVITDEE FDRMHHNWPR SDDVIERYDE IWTEVTAG
|
| |