Gene Nmag_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2044 
Symbol 
ID8824887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2082974 
End bp2084047 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content64% 
IMG OID 
Producttranscriptional regulator, HxlR family 
Protein accessionYP_003480176 
Protein GI289581710 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCACC CACCGACCCC GGCAGAGTCG ACCGCATTGT CGATCCTCGG CACCAAGTGG 
AAACCCCGGC TGATCGTTGC CCTCGCGACC AACGACCGGC TCAGCTTCGG CGATCTGAAA
CGCGAACTCA CGGGTATCTC GGGGAAGGTG CTGTCGGAGA ACCTGGACGA ACTGCGCGAT
CACGGCGTCG TTTCCCGCGA CGTCGTGCAA CAACAGCCTC GGCGCGTCGA GTACGAACTA
ACCGGGGCCG GGCGAGAGCT GTACCAGCTC ATCGAAGCAC TCACAGAGTG GGATGCGACG
TACGCAACCG AACGTGGTGT GCCGACAGTC CTCCTCGCCG AAGACGATCC GCGCCTGCGA
GAGCTCTATG CACTGTGGTT GCAAACCGAC TACGACGTAC TGACAGTCCC CGACGGTCAG
ACAGCACTCC GCTCCCTCGA CGAGTCAGTC GACGTGGCAG TCCTCGCTCG CGATCTGCCG
ACACTCGACG GGGCCGCGGT CGCAGCCGCA CTCGAGACGG CCGGGCAGCG AACGCCGGTC
GCGATCATCA CGTCGGCAGA CATCTCGCCG GAGGACGTCT CGATCTCGGC AGATCTGTTA
GTTCGAGATC CGCTCTCCAA AGCCGAGTTG ATCGACACCG TCGAACAGCT CACACGGCTT
CCGAAGGAGT CACCGATTGG CCGGGATATT CGTGCTCGCC GCCATCGGCT GGCGTTCGTC
GAGCGCCACC TCGGGCCGAC GGTCTCAGAG ACGGAGCCCT ATCAGCGGGC TGCGGACGAA
CTGACGGCAC TCGAGCAGGA ACGAGAGCGG ACAGCCGACG CGAGAGCGCC GTGGCGGCGG
CTGAGACGGG GAAACGGAGC GGAGTCGGAT GCGTCGGGTC GAGCAAAGCG GCGTGAATAC
GAAGCGCGGG AGCGGGGACA GGCGAATCAA GAACGAGAAC GAGCACAAGC ACAAAAACGG
AACCGAGACC GAGAGCGAAA ACGCGACCGC AACTCCAAAC GAGATCGGGC GGCTGAGAAA
GACCGCGATC ACAACCGAAC CCACGACGAC AGTGACGGGG ATGGGAACGA ATGA
 
Protein sequence
MSHPPTPAES TALSILGTKW KPRLIVALAT NDRLSFGDLK RELTGISGKV LSENLDELRD 
HGVVSRDVVQ QQPRRVEYEL TGAGRELYQL IEALTEWDAT YATERGVPTV LLAEDDPRLR
ELYALWLQTD YDVLTVPDGQ TALRSLDESV DVAVLARDLP TLDGAAVAAA LETAGQRTPV
AIITSADISP EDVSISADLL VRDPLSKAEL IDTVEQLTRL PKESPIGRDI RARRHRLAFV
ERHLGPTVSE TEPYQRAADE LTALEQERER TADARAPWRR LRRGNGAESD ASGRAKRREY
EARERGQANQ ERERAQAQKR NRDRERKRDR NSKRDRAAEK DRDHNRTHDD SDGDGNE