Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_3165 |
Symbol | |
ID | 8826026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | - |
Start bp | 3278673 |
End bp | 3280334 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | putative sodium symporter protein |
Protein accession | YP_003481279 |
Protein GI | 289582813 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGAGC TACTCGAGCC GTTCGTCCTC CAGGAGACGG CGCTCGACGT CGGCGAGTTC AAACTCGTGC CGGCGCTGAC CGTCGCGGCG ATGCTGGCGC TGTTCCTCGG CGTCGGCTAC TTCTTTCGGG TCGCGGCCGT CGACGATCTG TGGGTTGCCG GCCGATCGAT CGGTGCAGTT GAGAACGGGA TGGCGATCGG TGCAAACTGG ATGAGTGCGG CATCCTACCT CGGTGTCGCG TCGATTATCG CCCTCTCGGG CTACTTCGGA CTGGCGTACG TCGTCGGCTG GACGACGGGC TACTTCATCC TGCTCATCTT CCTGGCAGCC CAGTTCCGTC GGTTCGGGAA GTACACGGCA CCTGACTTCG TCGGCGACCG GTTCTACTCC GACTGGGCAC GCGGTATCGC GGCGTTCACC ACACTTGCGA TCGCGTTCAC CTACGCCATC GGGCAGGCCA GCGGCATGGG GTTGATGGCC CAGTACATCT TCGGAATCTC CTACGAGATG GGTGTGATCG TCCTGATGGG AGTCACTATC GGCTACGTCG CCCTCTCGGG GATGCTGGGG ACGACGAAGA ACATGGCGAT CCAGTACGTG ATTCTCATCA TTGCCTTCAC CATCGGCCTC TACGCCGTCG GCTGGAGCCA GGGCTGGTCG ACCGTTCTGC CGTACTTCGA ATTCGGCAGC GAAGTCGCGG CGGCGGCTGA GATCGAAGCG CAGTTCGTTG AGCCGTTCGC CGACGGTTCC TACTACGCCT GGATCGCGCT AGCGTTCAGT CTGATCGTCG GCACCTGCGG TCTGCCGCAC GTTCTCGTCC GGTTCTACAC CGTCGAGAAC GAGCGAACGG CCCGCTGGTC GACCGTCTGG GGCCTGTTCT TCATCTGCCT GCTGTACTGG GGAACGGCCA CCTACGCCGC GTGGGGCGGA TTGCTCTACG ACAGCGAGGT AACCGGCGGC GGTGGCTTCG CCGACATGCT CCCCATCGAA GCCGACGCGC TAGTCGTTCT GACTGCACAA CTCGCTGAAC TGCCGACGTG GCTCGTCGGA CTGGTGGCTG CCGGTGCCGT CGCCGCAGCG CTTGCGACCA CCGCGGGGCT GTTCATCTCG GCTTCCTCGG CCGCTGCACA CGACATCTAC ACGAATCTCT ACAAGGAGAA CGCAACCCAG CGCGAGCAGA TGCTCGTGGG CCGCGTGACG ATCCTGGGAA TCGGTATCCT GGTGACGCTC ATCGGTCTCA ACCCGCCTGC GCTCATTGGC GAACTCGTCG CGATGTCGTT CGCCATCGCG GGCACCGTCT TCTTCCCGGT GTTCTTCCTC GGGCTTTGGT GGGAAAACAC CACGAAAGAG GGCGCACTCG CGGGAATGCT CACCGGCATC CTGCTCTCGT TCGGTGCGAT CCTCAACGAC ACGGCGATCC CGATGTACAC CGGCGTCGAA GATGCGGTAA TTCCCGCGCT CGCGACCTGG CTGCCGGGGA CCTCCTCGGC ACTGGTCGGC GTGCCGGTCG TCTTCGGGGT CATCATCGTC GTTTCGATCG TGACGGACAA CCCACCACAG CACGTCAAAC GACTCGTCCG GCAGTGTCAC AGCCCGGAAC CGATGAGTCA GACGGAGTCG GCGGAAGACG CTGCGAACGG TGGGGCACCG GCTGACGACT GA
|
Protein sequence | MIELLEPFVL QETALDVGEF KLVPALTVAA MLALFLGVGY FFRVAAVDDL WVAGRSIGAV ENGMAIGANW MSAASYLGVA SIIALSGYFG LAYVVGWTTG YFILLIFLAA QFRRFGKYTA PDFVGDRFYS DWARGIAAFT TLAIAFTYAI GQASGMGLMA QYIFGISYEM GVIVLMGVTI GYVALSGMLG TTKNMAIQYV ILIIAFTIGL YAVGWSQGWS TVLPYFEFGS EVAAAAEIEA QFVEPFADGS YYAWIALAFS LIVGTCGLPH VLVRFYTVEN ERTARWSTVW GLFFICLLYW GTATYAAWGG LLYDSEVTGG GGFADMLPIE ADALVVLTAQ LAELPTWLVG LVAAGAVAAA LATTAGLFIS ASSAAAHDIY TNLYKENATQ REQMLVGRVT ILGIGILVTL IGLNPPALIG ELVAMSFAIA GTVFFPVFFL GLWWENTTKE GALAGMLTGI LLSFGAILND TAIPMYTGVE DAVIPALATW LPGTSSALVG VPVVFGVIIV VSIVTDNPPQ HVKRLVRQCH SPEPMSQTES AEDAANGGAP ADD
|
| |