Gene Nmag_1012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1012 
Symbol 
ID8823843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1031453 
End bp1034917 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content65% 
IMG OID 
Productputative PAS/PAC sensor protein 
Protein accessionYP_003479158 
Protein GI289580692 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0819572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACCG CACGGACTGT CGACGCTCCC CCGCCGCAAC TGCTCGTCGT CGGGGCTACA 
CTTGCGGACG AGTTCGCTTC TCTCTCCGCC GAGAACGGCG GACGGCTCGC CGGTGCCGAC
ATCGAGTCGG TCCCGTCGTC GTCGGCTGCA CTCGAGTGGC TCGACCAGCA GCCGAACGCC
GACAGGGTCG ACTGCGTGGT GACAGCGGCC GATCTTCCCG ACGGGTCGGG ACTGGCGCTA
CTCGAGGCGA TTCGGGGGCG CAGCACAGGA GCCGGAGAGG GAGTGGGAAG GGGAGACTCG
ATAAGTAGCA GGCGAGACGC AACGAGTGGA GACCACAGCG AGATCCCCAT CGTCCTCTCG
CCGGCGGCAG GTGACGGGAG TGATGCGCTG GCGCGAGCAG CCGCCGCTGC CGGTAGTACC
GAGTACGTCC CACGTGTCGG TGAGACGAGA GCGGGCGAAG ATACGGAGAC AGATTCAGCT
CCTGCTGTGG ATACCAGCTC GGTCCAGAAT GACCCACTCC TGGCAGCCGT CGAGCGGGTA
CTGTCGCGGG TTGAGCGACG AGACCGACAC CGAGAGCAGG CTCGCCAGTT CGAGGCAATC
TTCGACGATC CCTCGGCGTA CGCCTGGGTG CTCGATTCCG ACGGAATCGT CCGACGGGCG
AACGAGGGTG CGCTCGCCGA CCTCGACGCG ACGCCGTCCG ACGTTCGCGG GCGCGAACTC
TGGTCGCTCG CCTCGTGGGG CCGATTCGAT ACCTGTCGCG ACACGATCGA ACAGGCCGTC
GAGACGGCTG CGAGCGGCCG GGTGGTACGT CGGGAGATAA CGCGCGAGCG GCCGAATAAC
GAGGTCGAGA GAACCGGAGG TGATGACGAG GCGACAGGAA ACGACCGCCA GACGCTCGAT
CTGACCGTCC GACCAGTCAC CGACGGCGAC CGCGTCACGA CGATCCTCGT CCGCGCAACG
GATGTCACCG AGCGCGCGGC ACTCGAGTCC GATCTGCGCG AGTCCGAGGA ACTTCATCGG
GTGACGCTCA ATCACATGAC CGATACGGTC CTCATCACGA ACGACGAGGG TGAGTTTACC
TACGTCTGCC CGAACGTGCA CTTCATTTTC AGCTACACGG ACGAGGAGAT CCACGAGATG
GGGTCGATCG ACGAACTCCT CGGTGCGGAC CTGTTCGACC GTGCGGAACT CGCCGAGGAC
GGCGTGCTCA CGGATATTGA GTGTACGGCG ACGGACAAGG CGGGTCGCGA GCATACCCTG
CTGGTCAACG TCCGCGAGGT CTCGATTCAG GACGGAACGC ATCTCTACAG CTGTCGCGAT
ATTACGACCC GCAAGCGCCG TGAGGAGGCG TTGACGGCGC TTCACCGCAC CGCTCGCGAG
TTGCTGTACG CCGAGACGGA TCGCGAAATC GCGGCCATCA CTGTCGACGA CGCGACGGAC
GTGCTCGACC TCGAGGCGAG TGCGATCTAC CTGTTCGACA CCGACGAGAA CGTGCTTCGT
CCGGCCGCTC GTTCGGAGTC GATGGCGGCG CTTCACGGTC CGCTCTCAGC CCAGCAGGTT
GGTCAGGGTA TCGTCGGCGA CGTCTTCGTC GACGGCGAGA GCCGCCTCCT GGCGGACGTT
CACGACTCGC CGCTACTCGC CGAGCCGACG ACGGAGATCA GAAGCGCCGC GTTCGTCCCG
CTCGGCGATC ACGGCGTCTT CGTCGCCGGC TCGCCCGAGG TGGGTGTCTT CGACGAGGTC
TCTGGCGAGG TGACTGACCT GCTTGCAACG ACAGCGGAGG CGGCGCTCGA CCGCGTCGAA
CGCGAGCGCA CGCTTCGGGA GCGCGACCGC GAACTCAAGC GTCAGAACCG CCAGCTGACC
AGTCTCAACC AGATCAACGA GATCATCCGC GAGATCGACC AGGAACTCGT CCAGGCCGAA
ACGCGCGACG AGATCGAACA CGGCGTCTGC GACCGGTTGA CGGCCACCGA CCGCTTCTCG
TTCGCCTGGA TCGGTACTGA CGATCCATCC GGGGAGCGAC TCGAGTCTCG TACCCACGGC
GGGACGGATC GCGGTCGGGA CTACCTCGAC AGCGTCTCGC TCTCGCTTCC CGAACAGCCC
GCAGAGTCCG CAGCGTCCGT ACAGCCCGCA GAGTCCGCAG AGCCCGCAGA CGCTGCACAG
CCAGCTGACG GAGCCACAGG AGACGAATCG ACAGCGACAA CAGAAACCGC AACTGCAACC
GCACCATCCG TCCACGGCCG CGAACCCGCC GTCAGAACTG CCGTCACACG CGAGGGAACG
GTCGTCGCAA ACGTCGTCGA CGACCTGCGC GAGCAGCCCT GGCGGAGCGA GGCCCTCGCT
CGCGAGTATC AGTCGGTCAT CAGCGTCCCA CTCTCTTACG ACGAGTTCTC ATACGGTGTC
CTGACCGTCT ACGCGGACCG ACCCGACGCC TTCGACGAGG TCACCCGCGC TGTCCTCACC
GAACTCGGCG AAACGATCGC CTCGGCCATC GCTGCGGTCG ACCGCAAGCG CGCACTTCTC
TCGAACGCGA ACACGCGACT CGAGTTCGAC GTCGCAGACG AGAACTTCGT TTTCACCCGT
CTCGCGACAC GCGTAGACTG TACGATCTCG TTCGACGGCG GCGTTCGCCA GCACGAGGAC
GGGGCGACGG TGTTCGCCTC GGTCGAGGGC GCGCCGGCAG CCGATGTGGC CGCTGCCGCG
ACAGAACTCG TCGCGGTCAC CGATGCACAG GTCGTCAGTG ACCATCGACG AGGGGGGTAC
GGGAGTGTAA ACGCGAACGC GAGCGCAACT GTGAGCGCGG ACGCGAACTC GAACTCGAAC
CCGAACCCGG ACGCGAACTC GAACTCGAAC TCGAACCCGA ACCCGGACGC GAACTCGAAC
TCGAACTCGA ACCCGAACCC GGTCTCGAAC TCGAACACGA GTACCGACCT ATCCGAGTCC
GACAGGGCCA ACGCCGACGA AACCGACAGC AACGCGAACG GTGAGCGCGG TGGGACGATC
AGGCTCGAAC TCGCTCGGCC GTTTCCGGCA CTCACGCTCG CAGATCATGG CGCGATTCTC
CGGAGTGTCC GGGCGACACC GGAGTCGACT CGCGTCGTCG TCGACGTTCC GGCGGATGTC
GAGACAGGTA GTGGCGCTGC TGGCGGTGTC GGCACCGGTG CGAGCACCGA TATCGTGACA
ACCGCCTTTT CCGATATCGA ACTCCGTTCG AAACGTCGCG TTGACCGGAC GACGCCGCGT
GATATTCGGG CAGAACTACT CGAGCGCCTA ACCGACAGGC AACTCGAGGT GGTCCAGCAC
GCATACTACA GCGGTTACTT CGAGTCGCCG CGCGAGCGCT CCGGTGAGGA GATTTCGTCG
ACACTTTCGA TTTCGCCGGC CGCGTTCTAT CGGCACCATC GGACGGTCCA GCGAAAGCTC
TTCACTGTGT TGTTCGACGA TCTTGGTATT TCGACACACA CGTAG
 
Protein sequence
MDTARTVDAP PPQLLVVGAT LADEFASLSA ENGGRLAGAD IESVPSSSAA LEWLDQQPNA 
DRVDCVVTAA DLPDGSGLAL LEAIRGRSTG AGEGVGRGDS ISSRRDATSG DHSEIPIVLS
PAAGDGSDAL ARAAAAAGST EYVPRVGETR AGEDTETDSA PAVDTSSVQN DPLLAAVERV
LSRVERRDRH REQARQFEAI FDDPSAYAWV LDSDGIVRRA NEGALADLDA TPSDVRGREL
WSLASWGRFD TCRDTIEQAV ETAASGRVVR REITRERPNN EVERTGGDDE ATGNDRQTLD
LTVRPVTDGD RVTTILVRAT DVTERAALES DLRESEELHR VTLNHMTDTV LITNDEGEFT
YVCPNVHFIF SYTDEEIHEM GSIDELLGAD LFDRAELAED GVLTDIECTA TDKAGREHTL
LVNVREVSIQ DGTHLYSCRD ITTRKRREEA LTALHRTARE LLYAETDREI AAITVDDATD
VLDLEASAIY LFDTDENVLR PAARSESMAA LHGPLSAQQV GQGIVGDVFV DGESRLLADV
HDSPLLAEPT TEIRSAAFVP LGDHGVFVAG SPEVGVFDEV SGEVTDLLAT TAEAALDRVE
RERTLRERDR ELKRQNRQLT SLNQINEIIR EIDQELVQAE TRDEIEHGVC DRLTATDRFS
FAWIGTDDPS GERLESRTHG GTDRGRDYLD SVSLSLPEQP AESAASVQPA ESAEPADAAQ
PADGATGDES TATTETATAT APSVHGREPA VRTAVTREGT VVANVVDDLR EQPWRSEALA
REYQSVISVP LSYDEFSYGV LTVYADRPDA FDEVTRAVLT ELGETIASAI AAVDRKRALL
SNANTRLEFD VADENFVFTR LATRVDCTIS FDGGVRQHED GATVFASVEG APAADVAAAA
TELVAVTDAQ VVSDHRRGGY GSVNANASAT VSADANSNSN PNPDANSNSN SNPNPDANSN
SNSNPNPVSN SNTSTDLSES DRANADETDS NANGERGGTI RLELARPFPA LTLADHGAIL
RSVRATPEST RVVVDVPADV ETGSGAAGGV GTGASTDIVT TAFSDIELRS KRRVDRTTPR
DIRAELLERL TDRQLEVVQH AYYSGYFESP RERSGEEISS TLSISPAAFY RHHRTVQRKL
FTVLFDDLGI STHT