Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmag_1012 |
Symbol | |
ID | 8823843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natrialba magadii ATCC 43099 |
Kingdom | Archaea |
Replicon accession | NC_013922 |
Strand | + |
Start bp | 1031453 |
End bp | 1034917 |
Gene Length | 3465 bp |
Protein Length | 1154 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_003479158 |
Protein GI | 289580692 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0819572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATACCG CACGGACTGT CGACGCTCCC CCGCCGCAAC TGCTCGTCGT CGGGGCTACA CTTGCGGACG AGTTCGCTTC TCTCTCCGCC GAGAACGGCG GACGGCTCGC CGGTGCCGAC ATCGAGTCGG TCCCGTCGTC GTCGGCTGCA CTCGAGTGGC TCGACCAGCA GCCGAACGCC GACAGGGTCG ACTGCGTGGT GACAGCGGCC GATCTTCCCG ACGGGTCGGG ACTGGCGCTA CTCGAGGCGA TTCGGGGGCG CAGCACAGGA GCCGGAGAGG GAGTGGGAAG GGGAGACTCG ATAAGTAGCA GGCGAGACGC AACGAGTGGA GACCACAGCG AGATCCCCAT CGTCCTCTCG CCGGCGGCAG GTGACGGGAG TGATGCGCTG GCGCGAGCAG CCGCCGCTGC CGGTAGTACC GAGTACGTCC CACGTGTCGG TGAGACGAGA GCGGGCGAAG ATACGGAGAC AGATTCAGCT CCTGCTGTGG ATACCAGCTC GGTCCAGAAT GACCCACTCC TGGCAGCCGT CGAGCGGGTA CTGTCGCGGG TTGAGCGACG AGACCGACAC CGAGAGCAGG CTCGCCAGTT CGAGGCAATC TTCGACGATC CCTCGGCGTA CGCCTGGGTG CTCGATTCCG ACGGAATCGT CCGACGGGCG AACGAGGGTG CGCTCGCCGA CCTCGACGCG ACGCCGTCCG ACGTTCGCGG GCGCGAACTC TGGTCGCTCG CCTCGTGGGG CCGATTCGAT ACCTGTCGCG ACACGATCGA ACAGGCCGTC GAGACGGCTG CGAGCGGCCG GGTGGTACGT CGGGAGATAA CGCGCGAGCG GCCGAATAAC GAGGTCGAGA GAACCGGAGG TGATGACGAG GCGACAGGAA ACGACCGCCA GACGCTCGAT CTGACCGTCC GACCAGTCAC CGACGGCGAC CGCGTCACGA CGATCCTCGT CCGCGCAACG GATGTCACCG AGCGCGCGGC ACTCGAGTCC GATCTGCGCG AGTCCGAGGA ACTTCATCGG GTGACGCTCA ATCACATGAC CGATACGGTC CTCATCACGA ACGACGAGGG TGAGTTTACC TACGTCTGCC CGAACGTGCA CTTCATTTTC AGCTACACGG ACGAGGAGAT CCACGAGATG GGGTCGATCG ACGAACTCCT CGGTGCGGAC CTGTTCGACC GTGCGGAACT CGCCGAGGAC GGCGTGCTCA CGGATATTGA GTGTACGGCG ACGGACAAGG CGGGTCGCGA GCATACCCTG CTGGTCAACG TCCGCGAGGT CTCGATTCAG GACGGAACGC ATCTCTACAG CTGTCGCGAT ATTACGACCC GCAAGCGCCG TGAGGAGGCG TTGACGGCGC TTCACCGCAC CGCTCGCGAG TTGCTGTACG CCGAGACGGA TCGCGAAATC GCGGCCATCA CTGTCGACGA CGCGACGGAC GTGCTCGACC TCGAGGCGAG TGCGATCTAC CTGTTCGACA CCGACGAGAA CGTGCTTCGT CCGGCCGCTC GTTCGGAGTC GATGGCGGCG CTTCACGGTC CGCTCTCAGC CCAGCAGGTT GGTCAGGGTA TCGTCGGCGA CGTCTTCGTC GACGGCGAGA GCCGCCTCCT GGCGGACGTT CACGACTCGC CGCTACTCGC CGAGCCGACG ACGGAGATCA GAAGCGCCGC GTTCGTCCCG CTCGGCGATC ACGGCGTCTT CGTCGCCGGC TCGCCCGAGG TGGGTGTCTT CGACGAGGTC TCTGGCGAGG TGACTGACCT GCTTGCAACG ACAGCGGAGG CGGCGCTCGA CCGCGTCGAA CGCGAGCGCA CGCTTCGGGA GCGCGACCGC GAACTCAAGC GTCAGAACCG CCAGCTGACC AGTCTCAACC AGATCAACGA GATCATCCGC GAGATCGACC AGGAACTCGT CCAGGCCGAA ACGCGCGACG AGATCGAACA CGGCGTCTGC GACCGGTTGA CGGCCACCGA CCGCTTCTCG TTCGCCTGGA TCGGTACTGA CGATCCATCC GGGGAGCGAC TCGAGTCTCG TACCCACGGC GGGACGGATC GCGGTCGGGA CTACCTCGAC AGCGTCTCGC TCTCGCTTCC CGAACAGCCC GCAGAGTCCG CAGCGTCCGT ACAGCCCGCA GAGTCCGCAG AGCCCGCAGA CGCTGCACAG CCAGCTGACG GAGCCACAGG AGACGAATCG ACAGCGACAA CAGAAACCGC AACTGCAACC GCACCATCCG TCCACGGCCG CGAACCCGCC GTCAGAACTG CCGTCACACG CGAGGGAACG GTCGTCGCAA ACGTCGTCGA CGACCTGCGC GAGCAGCCCT GGCGGAGCGA GGCCCTCGCT CGCGAGTATC AGTCGGTCAT CAGCGTCCCA CTCTCTTACG ACGAGTTCTC ATACGGTGTC CTGACCGTCT ACGCGGACCG ACCCGACGCC TTCGACGAGG TCACCCGCGC TGTCCTCACC GAACTCGGCG AAACGATCGC CTCGGCCATC GCTGCGGTCG ACCGCAAGCG CGCACTTCTC TCGAACGCGA ACACGCGACT CGAGTTCGAC GTCGCAGACG AGAACTTCGT TTTCACCCGT CTCGCGACAC GCGTAGACTG TACGATCTCG TTCGACGGCG GCGTTCGCCA GCACGAGGAC GGGGCGACGG TGTTCGCCTC GGTCGAGGGC GCGCCGGCAG CCGATGTGGC CGCTGCCGCG ACAGAACTCG TCGCGGTCAC CGATGCACAG GTCGTCAGTG ACCATCGACG AGGGGGGTAC GGGAGTGTAA ACGCGAACGC GAGCGCAACT GTGAGCGCGG ACGCGAACTC GAACTCGAAC CCGAACCCGG ACGCGAACTC GAACTCGAAC TCGAACCCGA ACCCGGACGC GAACTCGAAC TCGAACTCGA ACCCGAACCC GGTCTCGAAC TCGAACACGA GTACCGACCT ATCCGAGTCC GACAGGGCCA ACGCCGACGA AACCGACAGC AACGCGAACG GTGAGCGCGG TGGGACGATC AGGCTCGAAC TCGCTCGGCC GTTTCCGGCA CTCACGCTCG CAGATCATGG CGCGATTCTC CGGAGTGTCC GGGCGACACC GGAGTCGACT CGCGTCGTCG TCGACGTTCC GGCGGATGTC GAGACAGGTA GTGGCGCTGC TGGCGGTGTC GGCACCGGTG CGAGCACCGA TATCGTGACA ACCGCCTTTT CCGATATCGA ACTCCGTTCG AAACGTCGCG TTGACCGGAC GACGCCGCGT GATATTCGGG CAGAACTACT CGAGCGCCTA ACCGACAGGC AACTCGAGGT GGTCCAGCAC GCATACTACA GCGGTTACTT CGAGTCGCCG CGCGAGCGCT CCGGTGAGGA GATTTCGTCG ACACTTTCGA TTTCGCCGGC CGCGTTCTAT CGGCACCATC GGACGGTCCA GCGAAAGCTC TTCACTGTGT TGTTCGACGA TCTTGGTATT TCGACACACA CGTAG
|
Protein sequence | MDTARTVDAP PPQLLVVGAT LADEFASLSA ENGGRLAGAD IESVPSSSAA LEWLDQQPNA DRVDCVVTAA DLPDGSGLAL LEAIRGRSTG AGEGVGRGDS ISSRRDATSG DHSEIPIVLS PAAGDGSDAL ARAAAAAGST EYVPRVGETR AGEDTETDSA PAVDTSSVQN DPLLAAVERV LSRVERRDRH REQARQFEAI FDDPSAYAWV LDSDGIVRRA NEGALADLDA TPSDVRGREL WSLASWGRFD TCRDTIEQAV ETAASGRVVR REITRERPNN EVERTGGDDE ATGNDRQTLD LTVRPVTDGD RVTTILVRAT DVTERAALES DLRESEELHR VTLNHMTDTV LITNDEGEFT YVCPNVHFIF SYTDEEIHEM GSIDELLGAD LFDRAELAED GVLTDIECTA TDKAGREHTL LVNVREVSIQ DGTHLYSCRD ITTRKRREEA LTALHRTARE LLYAETDREI AAITVDDATD VLDLEASAIY LFDTDENVLR PAARSESMAA LHGPLSAQQV GQGIVGDVFV DGESRLLADV HDSPLLAEPT TEIRSAAFVP LGDHGVFVAG SPEVGVFDEV SGEVTDLLAT TAEAALDRVE RERTLRERDR ELKRQNRQLT SLNQINEIIR EIDQELVQAE TRDEIEHGVC DRLTATDRFS FAWIGTDDPS GERLESRTHG GTDRGRDYLD SVSLSLPEQP AESAASVQPA ESAEPADAAQ PADGATGDES TATTETATAT APSVHGREPA VRTAVTREGT VVANVVDDLR EQPWRSEALA REYQSVISVP LSYDEFSYGV LTVYADRPDA FDEVTRAVLT ELGETIASAI AAVDRKRALL SNANTRLEFD VADENFVFTR LATRVDCTIS FDGGVRQHED GATVFASVEG APAADVAAAA TELVAVTDAQ VVSDHRRGGY GSVNANASAT VSADANSNSN PNPDANSNSN SNPNPDANSN SNSNPNPVSN SNTSTDLSES DRANADETDS NANGERGGTI RLELARPFPA LTLADHGAIL RSVRATPEST RVVVDVPADV ETGSGAAGGV GTGASTDIVT TAFSDIELRS KRRVDRTTPR DIRAELLERL TDRQLEVVQH AYYSGYFESP RERSGEEISS TLSISPAAFY RHHRTVQRKL FTVLFDDLGI STHT
|
| |