Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3851 |
Symbol | |
ID | 8826721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013923 |
Strand | + |
Start bp | 242694 |
End bp | 243755 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003481954 |
Protein GI | 289583544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0852756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAATTG GTAAGCAATC ACGTAGGCGG TTTCTCACCG CAACGGGTGG TGCTGCGGCA CTGGGAACAG TTGCAGGGTG TCTTGGTGGT GACGACGACG ACGTCGTGAA CTACTTCAGC TGGGGGGCGT ACATCGACGA CAGCTGGATT CAGCCGTTCG AGGAGGATAC AGGGATCACG GTTAACACCG AGACTTACGA GTCGAACGCC GACGCGATCA ACCAGATCGA GACCTCTCCA GAGGGAACGT ACGACGTCTG GACGCCGTCC GCGCCAGGCG CAGACTTCGA GCGTGCATAT CGGAACGACC TCCTCGATCC GATCGACCTG GATAACGTCC CCGGCTGGGA CGAGTACATC TTCGAGGAGA TGAAGCTCGA CGACTTCTGG TTCGACGACG AACTCTACGC CGTCCCGATG ACGTTCGGGT TCGACGGCGC AATCTACAAC CACGAGGAGG TCGGCGATCT CGGCGACGAG ATCAGCTACG ACGTGCTCTG GGACGACGAG TACGCCGGCG AGATCACGAG CCGAGACGAC GCGGCGACGC AGATTTGGAC CGCAGCGAAG TACCTCGACC AGGATCCGGA CGAGCCCGAG GATCTGGAGG CGGTCGAGGA CGCGCTGAAC GAACACGTCG ACCTGGTGAA CACCTACTGG ACCTCCTCTG CGGAGTCGAT CCAGATCTTC CAGCAGGGCG AGGCGTCGAT TGGGACATCG TGGGACGGTG CCTACCACCG CCTCGCGGCG GAGGACGAAC CGGTCTCGCT GGCCTTCTGG GAGGAAGGAA CTATCGGCTG GATCGACTCG TTCTGTATCG CCCGCGGCTC GGAGAACAAA GAAGAAGCCG AGCAGTTCAT CGACTACATG GTCTCCGAAG TCCCCCGGGC GTGGTTCGAG GGGCCGGAGT ACATTGTCGT CTCCGACGCG GTCGATTACA CCGACGAGGA ACTCGACCAG TACAACCTCG AGATGGCACT CGAGAACACC GAGTTCCCGG CGTACAACGA CGACGATCAG ATCCAGACCT ACGACGAGAT TTGGACGGAC GTCACCGTCT AA
|
Protein sequence | MPIGKQSRRR FLTATGGAAA LGTVAGCLGG DDDDVVNYFS WGAYIDDSWI QPFEEDTGIT VNTETYESNA DAINQIETSP EGTYDVWTPS APGADFERAY RNDLLDPIDL DNVPGWDEYI FEEMKLDDFW FDDELYAVPM TFGFDGAIYN HEEVGDLGDE ISYDVLWDDE YAGEITSRDD AATQIWTAAK YLDQDPDEPE DLEAVEDALN EHVDLVNTYW TSSAESIQIF QQGEASIGTS WDGAYHRLAA EDEPVSLAFW EEGTIGWIDS FCIARGSENK EEAEQFIDYM VSEVPRAWFE GPEYIVVSDA VDYTDEELDQ YNLEMALENT EFPAYNDDDQ IQTYDEIWTD VTV
|
| |