Gene Nmag_4101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4101 
Symbol 
ID8828835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp144135 
End bp146495 
Gene Length2361 bp 
Protein Length786 aa 
Translation table11 
GC content59% 
IMG OID 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_003482184 
Protein GI289937582 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCGA CGCCGTTGAC GGATACCCTC CGCGAAACTC TCGCAACCTT CGAGGAAGGC 
GGGGCTCCTC AAACAACGAC CGAGGTCGCC GAACAGTTCG ACCTCGGTCG GCGGAGTGCC
TACGAACGAC TGGAGCGACT CGTCGACCAC GACCAACTCG AAACCAAGAA AGTCGGGGGA
AACGGGCGCG TCTGGTGGCG GCCAGCGACG GTCGGCGGAG CGACTACACA AGATGCGGTG
GATAGCGACC TCGGAAAAGT CTTTGATCGA ATCTCGGACA GTTTCTACGC CCTCGACGAA
GAACTTCGCT TTCGCTATCT GAACGACACC GCAGCAGACC TTTTTGGACT GGATGAGGGT
GCAATTGGGT CGGACATTCG TAACGAGAAC GTCACGGAGA CGTACGAGAA CGCACTGTGT
GAGGCTCTTG AAACACAAGA GCCCGTAACC TTCGAAGACT ACTATCCGCC GTTAGACAAG
TGGTTCCAGA ACGTCATCTA TTCCTCAGAA TCGGGTCTGT CGGTCTATAC CCGCAACATC
ACCGAGCGGA AGCGACGTGA ACAAGAACTG GCTCGCTACG AAACGATCGT CGAAACGAGT
CCGATCGGCA TCACGATCGT TGACAGTGAC GGCAAGATGC AGTTCGCCAA CGACCGCGCT
GAGGAAATAT ACGGTCGAAG CAAAGCCCAG ATCAACGACC TCAGCTTCGA CGATTCCGAC
TGGAACGAAG TCGACGTCGA CAGCACTCCC CTCCCAGACG ACGAAAAGCC CTTCCCGCAA
ATTATCGAGT CCAAGGATTC GGTGTTCGAT CATGTTAGCG GCGTGTCTCG TCCAGATGGA
GAGCGCGTCT GGGTCTCCGT CAACGGGGCA CCGGTCTACG ACGAGCGTGG GGAAATCGAG
AGTGTCGTGT TCAGTATCGA AGACGTCACT GAACGGCGGG AACGACAACG CGCTCTCGAG
GAGAGCGAAC GTCGCTACCG GACGCTCGCC GAAAACTTCC AGAACGGGCT CGTCACCCAG
TTTGACGAGG AGCTTCGGTA CACGCTCGCC GCGGGGCAGG CATTCGACTA CCTCCCCCAC
TCGCCGGACG ACGTCGAAGG TCAGCACCTC CACGAGGTAT GGGACACCGA CGTGGCCGAC
ACGATCGAAC CCGTCTACCA GACCGTCCTT GAAGGCGAGA AGCAGTCGAT CGAGGGCACG
TCCGAAGGGT GCGAGTGGAT TGTCCAGGTC GTCCCCCTCA CGGATCAGGA TGGCGAGATT
CACGGGGGAA CAGCAATAGC CCTGGACATC ACCGAGCGCA AAGAACGCGA ACGTGCGCTC
GAAGAATCCG AACGGCGGTA CCGGACGCTC GCCGAAAACT TCCCAAACGG TGCAGTTGGT
CTCTTCGACG ATGATCTGCG GTATACGGCC GTCGGCGGGG AGTTGCTGGA CACGGTCGGC
GTCTCTCCAG AAGATTGGGT TGGAAACAGT GTCTACGACA CCTACCCGGA CGAGCTCGTT
GAGGAGGTCA AACAATACTT CGAGGCCGCG CTGGAGGGCA AGATGAACTC GTTCGAGGTC
GAGTACCACA ATCGCAACCT GTTCGTGAAC ACCTTACCCG TCAGAAACGT CGACGATGAG
GTCTACGCAG GGATGCTCGT CGTCCAAGAC GTGACCGAAC GACGGGAGTA CCAGCGCAAA
CTCGAGGCGT CGAACGAACG CCTCGAACAG TTTGCCTACG CAGTTTCACA CGACCTCCAG
GAGCCACTCC GAATGGTGAC GAGCTACCTC CAGTTGCTCG ACCAGCGGTA CGGCGATGCG
CTCGGCGAAG ACGGCGAGGA GTTCATCGAG TTCGCCGTCG ACGGCGCCGA ACGAATGCGA
GCGATGATCG ATGGGTTACT CGAGTACTCG CGGGTCGAAA CGCAAGGGGA GCCGTTCGAG
CCGGTCGATC TCGACGCCAT TCTGGAAGAG GTCCGCGACG ACCTCCAGTT GCGGATCGCC
GAGACGGAAG CCGAGATTAC GGCCGACGAC CTGCCGACAG TGGCCGGTGA CGCCAGCCAG
TTGCGCCAGG TGTTCCAGAA CCTGTTGAAC AACGCGATCG AGTACAGCGG CGACGAGCCA
CCACGGATCG ACATCGACGC CGAGCGTGTG GGTGGCCAGT GGCAGATTAC AGTCAGTGAT
CACGGGATTG GGATCAATCC CGACAACCAG GATCGCGTGT TCGAAGTGTT CCAGCGACTC
CACACCAGCA ACGAGCACCC GGGAACTGGC ATCGGCCTGG CCCTCTGTCA ACGGATCGTC
GAACGCCACG ATGGTGAACT ATGGGTCGAG TCCGAACCCG GCGACGGCTC GGCGTTCTCG
TTCACACTCC CTGTACCGTG A
 
Protein sequence
MSSTPLTDTL RETLATFEEG GAPQTTTEVA EQFDLGRRSA YERLERLVDH DQLETKKVGG 
NGRVWWRPAT VGGATTQDAV DSDLGKVFDR ISDSFYALDE ELRFRYLNDT AADLFGLDEG
AIGSDIRNEN VTETYENALC EALETQEPVT FEDYYPPLDK WFQNVIYSSE SGLSVYTRNI
TERKRREQEL ARYETIVETS PIGITIVDSD GKMQFANDRA EEIYGRSKAQ INDLSFDDSD
WNEVDVDSTP LPDDEKPFPQ IIESKDSVFD HVSGVSRPDG ERVWVSVNGA PVYDERGEIE
SVVFSIEDVT ERRERQRALE ESERRYRTLA ENFQNGLVTQ FDEELRYTLA AGQAFDYLPH
SPDDVEGQHL HEVWDTDVAD TIEPVYQTVL EGEKQSIEGT SEGCEWIVQV VPLTDQDGEI
HGGTAIALDI TERKERERAL EESERRYRTL AENFPNGAVG LFDDDLRYTA VGGELLDTVG
VSPEDWVGNS VYDTYPDELV EEVKQYFEAA LEGKMNSFEV EYHNRNLFVN TLPVRNVDDE
VYAGMLVVQD VTERREYQRK LEASNERLEQ FAYAVSHDLQ EPLRMVTSYL QLLDQRYGDA
LGEDGEEFIE FAVDGAERMR AMIDGLLEYS RVETQGEPFE PVDLDAILEE VRDDLQLRIA
ETEAEITADD LPTVAGDASQ LRQVFQNLLN NAIEYSGDEP PRIDIDAERV GGQWQITVSD
HGIGINPDNQ DRVFEVFQRL HTSNEHPGTG IGLALCQRIV ERHDGELWVE SEPGDGSAFS
FTLPVP