Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_2152 |
Symbol | |
ID | 8824997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 2199966 |
End bp | 2202896 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | |
Product | DNA-directed RNA polymerase subunit A' |
Protein accession | YP_003480283 |
Protein GI | 289581817 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.511073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAACA GTACACCAAA AGACATCGGA TCGATCAACT TCGGGCTCAT GGAGCCCGAA GAGTACCGGG AGATGAGCGC GACGAAGATC ATCACCGCCG ACACCTACGA CGACGACGGG TTCCCCATCG ACATGGGACT GATGGATCCG CGTCTCGGCG TGATCGACCC CGGACTCGAG TGCAAGACCT GTGGGAAGCA TTCGGGTTCC TGTAACGGTC ACTTCGGTCA CATCGAACTC GCCGCGCCTG TCATCCACGT CGGCTTTACG AAGCTCATCC GCCGGCTGCT GCGTGGCACC TGCCGTGAGT GTTCGCGCCT GCTGCTGGTC GAGGACGAGC GCGAGGAGTT CCAGGACCAG ATCGAGGAGT CTCGAAAGCT CGGCCGTGAC CTGAACGACG TGACGAAGGC GGCGATCCGT CAGGCTCGCA AGAAAGACCG CTGTCCGTTC TGTGGCGAGG TCCAGTACGA TATCGACCAC GAGAAGCCGA CGACCTACTA CGAGGTTCAG CAGGTCCTGA CCAGCGAGTA CTCCCAGCGC ATCGCCGGCG CGATGCAGGG GGACGAGGAG GCGGGCATCG AGCGGACCTC GCCGGACGAA CTCGCCGCGG AGACGGAGAT TGACCTCACC CGGATCAACG AGATCCTCTC GGGATCGTTC CGTCCGCGCG AGAGTCAGCG CGAGGCCATC GAGAAGGCAC TCGACATCGA CCTCACCGAG GAGGACACGA ACAAGCTGAT GCCAAGCGAC ATCCGGGACT GGTTCGAGAA CATTCCGGAC GAGGACATCG AAGTACTCGG CATTGATGCC GACCGCTCGC GGCCGGAGTG GATGATCCTC ACCGTCCTTC CGGTCCCGCC GGTCACTGCG CGGCCGTCGA TTACGCTCGA CAACGGCCAG CGCTCCGAGG ACGATCTTAC GCACAAACTG GTCGACATCA TCCGGATCAA CCAGCGGTTC ATGGAGAACC GTGAGGCAGG AGCGCCACAG CTGATCATCG AGGACCTCTG GGAACTGCTG CAGTACCACG TCACCACGTT CATGGACAAC GAAATCTCGG GGACGCCGCC GGCCCGACAC CGCTCCGGCC GGCCGCTCAA GACGCTCTCC CAGCGCCTGA AGGGCAAGGA GGGTCGATTC CGAGGCTCCC TGTCCGGGAA GCGTGTGAAC TTCTCGGCTC GTACCGTCAT CTCGCCGGAC CCGACCCTCT CGCTGAACGA GGTCGGTGTG CCAGACCGCG TCGCAAAGGA GATGACCCAG ACGATGAACG TCACCGACCG AAACGTCGAG GAGGCTCGAC GCTACGTTTC GAACGGTCCC GAGGGCCACC CTGGCGCGAA CTACGTCCGC CGACCCGACG GCCGCCGACT AAAGGTGACC GAGAAGAACT GCGAGGCACT TGCCGAGAAG GTCGACGCCG GCTGGGAAGT CAACCGGCAC ATGGTCGACG GCGACATCGT CATCTTCAAC CGTCAGCCGT CGCTTCACCG CATGTCGATC ATGGCCCACG AGGTCGTGGT CATGCCGTAC AAGACGTTCC GCCTCAACAC CGTCGTCTGT CCGCCGTACA ACGCCGACTT CGACGGTGAC GAGATGAACA TGCACGCGCT GCAGAACGAG GAAGCCCGTG CGGAAGCGCG CGTCCTCATG CGCGTCCAGG AGCAGATCCT TTCCCCGCGG TTCGGTGAGA ACATCATCGG AGCGATTCAG GACCACATCA GTGGGATGTA CCTCCTGACC CACGACAACC CGCGGTTCAA CGAAACGCAG GCACTCGACT TGCTCCGTGC AACGCGGATC GACGAACTGC CAGAGCCAAG CGGCATCGAC GACGAAGGTG AGCCGTTCTG GACCGGTCAC GACGTCTTCT CCGAACTCCT ACCCGACGAT CTGAACCTCG ACTTCACCGG CACCGTCGGC GAGAAGGTCA TCATCGAGGA CGGCCAGCTC ATCGAGGGCA CCATCGCCGA GGACGAAGTC GGCGAGTTCG GCGGCGAGAT CGTCGACACC ATCACGAAGA TCTACGGCAA CACCCGCTCG CGGATCTTCA TCAACGAGGT CTCGACGCTC GCGATGCGTG CCATCATGCA CTTCGGATTC TCGATCGGGA TCGACGACGA AACCATTCCG GGCGAGGCGC AGTCCCGAAT CGACGAAACG ATCGAGGACG CGAACGACCG CGTCGAGGAA CTCATCGAGG CCTACGAGAA CAACGAACTC GAGTCCCTCC CGGGCCGGAC GCTCGACGAG ACACTCGAGA TGAAGATCAT GCAGACGCTC TCGCGTGCGC GTGACAACGC TGGTAACATC GCAGACGAGC ACTTCGATGA CGAGAACCCG GCGGTTGTCA TGGCCAACTC CGGTGCGCGT GGGTCGATGC TCAACCTGAC GCAGATGGCC GGTGCAGTCG GTCAGCAGGC AGTTCGGGGC GAGCGAATCA ACCGCGGCTA CGAGGACCGC ACGCTCAGCC ACTACAAGCC AAACGACCTC TCTGCGGAGG CCCACGGCTT CGTGGAGAAC TCTTACACGT CGGGGCTGAC CCCGCGGGAA TTCTTCTTCC ACGCGATGGG TGGCCGCGAG GGTCTGGTCG ACACGGCAGT CCGGACCTCG AAGTCCGGCT ACCTGCAGCG TCGCCTGATT AACGCGCTCT CGGAACTCGA AGCGCAGTAC GACGGCACTG TTCGGGACAC CTCGGACACG ATCGTCCAGT TCGAGTTCGG CGAGGACGGC ACCTCGCCGG TGAAAGTCTC CTCGGGTGAC GACAACGACA TCGATGTCGA GCAGATCGCA GACCGTGTGC TCGACTCGGA GTTCGCCTCC GAGGCCGAGA AGGAAGAATT CCTCGGCACG AAGCGCCAGC CGACGAACCT CTCCGAGCAC GCCGATACTC GCTTTGTCGA GGAGACAGAG GAGGTGACCT CCGATGACTG A
|
Protein sequence | MRNSTPKDIG SINFGLMEPE EYREMSATKI ITADTYDDDG FPIDMGLMDP RLGVIDPGLE CKTCGKHSGS CNGHFGHIEL AAPVIHVGFT KLIRRLLRGT CRECSRLLLV EDEREEFQDQ IEESRKLGRD LNDVTKAAIR QARKKDRCPF CGEVQYDIDH EKPTTYYEVQ QVLTSEYSQR IAGAMQGDEE AGIERTSPDE LAAETEIDLT RINEILSGSF RPRESQREAI EKALDIDLTE EDTNKLMPSD IRDWFENIPD EDIEVLGIDA DRSRPEWMIL TVLPVPPVTA RPSITLDNGQ RSEDDLTHKL VDIIRINQRF MENREAGAPQ LIIEDLWELL QYHVTTFMDN EISGTPPARH RSGRPLKTLS QRLKGKEGRF RGSLSGKRVN FSARTVISPD PTLSLNEVGV PDRVAKEMTQ TMNVTDRNVE EARRYVSNGP EGHPGANYVR RPDGRRLKVT EKNCEALAEK VDAGWEVNRH MVDGDIVIFN RQPSLHRMSI MAHEVVVMPY KTFRLNTVVC PPYNADFDGD EMNMHALQNE EARAEARVLM RVQEQILSPR FGENIIGAIQ DHISGMYLLT HDNPRFNETQ ALDLLRATRI DELPEPSGID DEGEPFWTGH DVFSELLPDD LNLDFTGTVG EKVIIEDGQL IEGTIAEDEV GEFGGEIVDT ITKIYGNTRS RIFINEVSTL AMRAIMHFGF SIGIDDETIP GEAQSRIDET IEDANDRVEE LIEAYENNEL ESLPGRTLDE TLEMKIMQTL SRARDNAGNI ADEHFDDENP AVVMANSGAR GSMLNLTQMA GAVGQQAVRG ERINRGYEDR TLSHYKPNDL SAEAHGFVEN SYTSGLTPRE FFFHAMGGRE GLVDTAVRTS KSGYLQRRLI NALSELEAQY DGTVRDTSDT IVQFEFGEDG TSPVKVSSGD DNDIDVEQIA DRVLDSEFAS EAEKEEFLGT KRQPTNLSEH ADTRFVEETE EVTSDD
|
| |