Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_4116 |
Symbol | |
ID | 8828850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013924 |
Strand | + |
Start bp | 160735 |
End bp | 163683 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003482197 |
Protein GI | 289937595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.147835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATTCGA CTGCCGGACA GAACACCGAA TTGCAAACGA GAGTCCGGCA ACAGGAAGTT GCCGCCGAAA TCGGTCAGCA AGCACTCGAA ACCAACAATC TCGACCAGTT ATTGCACGAG GCTACGGTTG CTATTGCCGA GACACTTGAT AGTGACTACG CCAAGGTGCT GGAATTGCTC CCTGGTGAGG ACGAAATGAT CTTACGGCAG GGCGTCGGTT GGCGCGATGG TCTCGTCGGC GACGCGACGG TGCCGGCGGC TGTGGACTCG CAAGCGGGCT ACACGCTGCT CTCAGAGGAG CCAATCATCG TAGACGACCT TCGAACCGAT GAACGGTTTT CCGGGCCTGA GTTGCTTACC AGTCACAACG TCGTCAGCGG TATCAGCGTT ATCATCGGCT CTGTCGAGGA TCCATGGGGT GCCCTCGGTG TACACACGGC TGAGTGCCAC GAATTCACTG ACCACGACGT GAACTTCGTC CAGAGCATCG CTAACGTGCT GGCGGCGGTG ATCACGGAAA TACATGCGAA GCAACAGCTC CACGAACGGG AGAACGATCT CAAAGAGACG TTTGACCGCA TCACGGACGG TTTCATTGGG GTGGATGCTG ACTGGGTCAT CACATATACC AACGACCGCG GGAAGGAACT GATTACTCTC GATGACGAGG AACTGGTCGG CCGAAACTTC TGGGATGTGT TCGAACCGGC GCTCGGAACG ACGTTCGAGA AGCACTACCG CGAGGTTGTC ACCACCCAGG AGCCCACCAT ATTTGAGGAG TACTACCCGC CGCTGGATGG CTGGTTCGAG GTCCACGTTT ATCCGTCCGC ATCGGGTCTA TCAATTTACT TCCGCGACAT AACTGAACGC CGGGAGCACG AACGCGACCG GGCACTGTTT CGCACGTTAC TTGATCACTC CAGCGACGCC GTCTTCGTCG AAGATCCTGA GACTGGCGAG ATTCTCGACG TCAACGACAC CGCCATCCGA CAGCTCGGCT ACCCTCGCGA GGAACTACTG GATCTAACGA TAGCCGACAT CGACACCGAG CTCCCCACGC AGGAGGACTA CCGGGAATTC GTGATGGATC TCAAAGCCGA TGGGCACACT ACCTTTAACG GGACGCATCA ACGGAAGGAC GGCTCTACGT TCCCGGTGGA AGTCAACGTC TCGTACATCG AGCTGGATCA GCTAGATCGG GCGTACGTAC TCGCGCTCTC GCGCGATATC ACCGAACGAA CGCAGGCGAA AGAACGCATT CGTGAGAACG AGGCGGCGCT CGAACGACTC AACGTCACAA CCCAGGAACT GATCGAGGCT GATCCCGAGG ACATAAAGCG CCGGACCGCC GACATCGCCC AGTCGGTCCT CAACGTCGAG TACGCCGCGC TTTGGCACTA CGACGAGGCG ATCGGCGAAC TCGACGAATA CGCCAGTCAA ACGAGCCCCG ACATAGACGC TGATGCAGTT CACCTACCAG ACCAGCTTTC CGAGCAGGCC TGGCAGGCAT TCATCAGTGA TGATGTAGAC GTCGGAAACG ACGTCAAAAG TAGTGAAAGC GAGGGCGGAG AGTTCCCACT GCGGAGTCGT GTCTTCGCCC CGTTGGGCAG ACATGGCGTT ATTTGTGTCG GCTCCACTCG AGTCAACACG TTCGACGATA GGTTAGTTGA TCTCGTAGAG ACGGTCGCCA CCACAGTCGA GACCGCGTTG GACCGTGCCG CCGGCAAAGC GGAGTTGGAA CAGCAGAACG AGGAGTTAGT CCGGCTTGAC CGGCTCAACA TACTGATCAG GGAGATTGAT CAGGCGCTCG TCCAGGCGGA AACAGTCGAA TCCATCGATG AAGCCGTCTG CGACCGCCTG GCCGAATCTG GGCTGTTCGA GTTCGCTTGG ACCGGCGAGT TCGACGCAGC CGCCGGCACG GTTACCCCTC GGGAGTGGGC GGGGATTGAT ACGGCATCTC TGGAACAGCT CACTACAGGT GAGGAGTCTG CCATTGGTGA GAGCCAGGTC GTCGACGCAG TCCGGTCGGG CGAGGTTCAA GTCGTCGCTG ACACCGCAAC AGATCCACGG GCAACGCCAT GGCGAGAAGC CGCGCTTGAA TCCGGTGGGC GGTCGTGTTT TTGTATCCCG CTCGTGTACG ATGAGTCTAT CTACGGTGTC CTGGTTGTAT ACGGCAGGAC GCCACAGCCC GACGAACGGA ACGTGGACGT CCTCTCGGAA CTCGGACAGA CGATTGCCCA CGCGATTCAC GCGGTCGAAA CCAGAGTGGC TCAGCGAACC GACAGCGTTG TCGAACTCAC GCTTCAAACG ACGGCGGAAA CGCCACTCGT TCGGCTCGCT CTGGAGGCAG ACTGTGTACT CAAGTTTGAG GGAGTAGTGC CGAGAGCCGA CGGGGACGTG ACTGTGTTCT TCACTACGAG AGATGTCTCC CCCGACGAGA TGATTGATGC CGGCGAGCGA TCGCTCGTCA TCAAGAAAGT ATCCCATCTA GCGGAACAGA ACGGCGGTTT CCTGTTCAAA GCACAGCTGG CTAACTCGAC GCTTGCCAGG AGATTCCTCG ACCGGGGCGC GACGATCCGT TCGCTAAACA TCGATGCGGG GACGGCTACT GCCGTCGTGG AGTTACCGGA GACAGCTGAC GTCCGCGAGT TCGTTGCGGG GCTGAAGCAA GACGTGCCCG ACTGTAATTT GCTCGCTCGC CAGTCTCGAA CCAGGTCACC CGATACGGAA CAGCGGCTCC AAACCGCATT CGATCAGCGT CTCACCCCTC GTCAACAAGA GATACTCCAA CTGGCCTACC GGAGTGGGTA CTTCGAGTCC CCCCGCGTCC AAACGGGGAA GGAACTCTCC GACGCGCTGG ACCTCTCGCA ATCGACGTTC AACTATCACC TCAGAGGTGG CGAACGCACA CTCCTGGCGA TGGTGTTTGA TCACGTCCCA GATGCGTAG
|
Protein sequence | MDSTAGQNTE LQTRVRQQEV AAEIGQQALE TNNLDQLLHE ATVAIAETLD SDYAKVLELL PGEDEMILRQ GVGWRDGLVG DATVPAAVDS QAGYTLLSEE PIIVDDLRTD ERFSGPELLT SHNVVSGISV IIGSVEDPWG ALGVHTAECH EFTDHDVNFV QSIANVLAAV ITEIHAKQQL HERENDLKET FDRITDGFIG VDADWVITYT NDRGKELITL DDEELVGRNF WDVFEPALGT TFEKHYREVV TTQEPTIFEE YYPPLDGWFE VHVYPSASGL SIYFRDITER REHERDRALF RTLLDHSSDA VFVEDPETGE ILDVNDTAIR QLGYPREELL DLTIADIDTE LPTQEDYREF VMDLKADGHT TFNGTHQRKD GSTFPVEVNV SYIELDQLDR AYVLALSRDI TERTQAKERI RENEAALERL NVTTQELIEA DPEDIKRRTA DIAQSVLNVE YAALWHYDEA IGELDEYASQ TSPDIDADAV HLPDQLSEQA WQAFISDDVD VGNDVKSSES EGGEFPLRSR VFAPLGRHGV ICVGSTRVNT FDDRLVDLVE TVATTVETAL DRAAGKAELE QQNEELVRLD RLNILIREID QALVQAETVE SIDEAVCDRL AESGLFEFAW TGEFDAAAGT VTPREWAGID TASLEQLTTG EESAIGESQV VDAVRSGEVQ VVADTATDPR ATPWREAALE SGGRSCFCIP LVYDESIYGV LVVYGRTPQP DERNVDVLSE LGQTIAHAIH AVETRVAQRT DSVVELTLQT TAETPLVRLA LEADCVLKFE GVVPRADGDV TVFFTTRDVS PDEMIDAGER SLVIKKVSHL AEQNGGFLFK AQLANSTLAR RFLDRGATIR SLNIDAGTAT AVVELPETAD VREFVAGLKQ DVPDCNLLAR QSRTRSPDTE QRLQTAFDQR LTPRQQEILQ LAYRSGYFES PRVQTGKELS DALDLSQSTF NYHLRGGERT LLAMVFDHVP DA
|
| |