Gene Nmag_0766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0766 
Symbol 
ID8823594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp765395 
End bp768700 
Gene Length3306 bp 
Protein Length1101 aa 
Translation table11 
GC content59% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003478913 
Protein GI289580447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGAA CCGAACCAAT TATTGACGAC CGAAGTCAGG AAGAGATCTA CGAGACGCTT 
CGCCAGCGGG CACGCACATA TACCGAAACG TGGGATCCGG AGTCAGGCGA CGTCGGCCAG
ACGCTCGTTC GGATCTTCTC GTCGTTCGAA GCCGACGTAT GGAACCGCCT CAACGAGGTC
CCGGAAAAAC ACTTGCTCGG CTTTCTGGAT GCGCTGGACT TCGATCGGCG ACCCCCGCAA
GCAGCTCGTG TTCCGCTTTC GTTTGACGTC TCCAGAGATC TCGATCGAAA CGTGCCAATC
CCCGGGGGAA CGACCGCTAT CGCGGATCCG ACGGACGGAG AGGCACAACA GTTCGAACTT
CCACAGGAGG CCGGGTTCGA AGCCACACCT GCGTCGCTTA CCGACATCGT CGCGGTCGAT
CCGACCGCCG ACGCGATCGT CGATCACGAC ACTTTGCTGG AGAACGAACA GACACGGCTG
TTCGACGGCG AGCTGATACA GTCCCATGCA CTCTATCTCG AGAACGAATC CGCACTCAAT
GTAGCCGCTG GATCGACATT TACCGTCAGA GTCGATTCGC AGTCGGATCC CGAACCGATC
TTCGAAGAGA CGATCTGGGA GTATTACGGT GAAAACGAGG ACGGAGAACA CGGATGGCAC
CCCCTCAAGC GCGTCGTAGA CGACACACCG ATGTCCGACG ACGGCGGTGT GAAAGCCCTG
CAAGAACAGC TACAGGCAAA CTCCAGTACT GGGGACGACG ATTCACGCAC CCGTGACGGC
GTTGCCGAAC AGCGATTTCG GCTGCCCGGA GAACCGACGA TCCACGAGGT AAACGGGACT
GAGGGGCGGT GGCTGCGCTG TTCGCTACGC GACGGGACGG CTGTCCCGAC GTGGACTGTT
CGTTCGATAT CGATACAGGT GACGAGCAGC GATCGGGAGA CCGATTCGGA GACGGGTCTC
GACCCGGATA TGCTGCTGTC GAACGACGTC CCGGTCGCGC TCGACGACGA ACGATTGCAC
CCGTTCGGAC GGGTCCCACA CCCTCCGGTG ACGTTTTTCG TCGCTTGCGA GGAGGCGTTT
ACCAAGCCCG GTGGGACAGT CGAACTCGAG TTTACACCGC CGGCCGAATC CGAATCCGAG
ACCCCGCAGG ACTCGAGTTC GTCGGCCACC AGTGGCCGGG ATGGAGACGA TGGAGTTGGT
GTGACAGACA GTGACGCGAT GGCGGACGCG TTACCCCATC TCGACGAAGA GACGAACGTG
AATAGAGGAG TTCTGGACGG ACCACCCGAA GTCTCCTGGG AGTACTGGAA CGGAAACGGG
TGGATGCGGT TGGATTCCGT CGCAGACGAG ACCGAGGCAT TACGACACCC CGGTCGTGTT
CAGTTCGAGG TGCCGTCGGA TATCGAGCCG ACGACAGTTG CAGGTCACGA CAACGTCTGG
ATCCGTGCAC GCCTCGTCAG CGGGACGTAC GACCGGCCGT CGATCGAAGA TTCCGATGGA
CCGCTGTCGA CGTCTTTCAC AACGAGGCCC GATCCACCGG TTTACGGCGA TGTTATCGTC
GAGTACGAGC ACCTCGACCT CTCGTTCAAT ACGATCGTCC GGCAGAACAA CGGCGCATAC
AGTGACGACC TGACGAAACG AGAGTGGAGT TTCGATCCGT TCGTCTCGCT CCCCGACGAC
GCGCAGACGG TGTATTTCGG GTTCGACCGA CGACTGGAAA ACGGTCCGAT AAGTCTCTTT
TTCGTGATCG ACGACGCCAC GTATCCACAA CAGTTCGATC CAGGCGTCCA GTGGGAGTAT
TGTACTGTGG ACAACGAGCC GACCTGGAAC CAGATGGATG TTCGCGATCA GACCGCCGGT
CTGACCGAGC GAGGCATGGT CATGCTGACG TTTCCGACCC CAACGACGGA GGCTGAACTG
TTCGGTCGAC AACGTCACTG GATCAGGGCA CGGCTGACGA AAGACGAGTT TAGGACCCAC
CTCGATGTCG AGGACGAAGC CGTTGCGTTC CAGTCGGAAG GCGGGAAGAT CGGTAAGGGT
TCCGTCACCG ATCGAGATCG AGGGACGACC ACCCGCGCCC GTACCCACGA CAGCCAGTCG
TCAATCCCGG CGAACATGAC GACAGAACAA ACGACGCTTC CCCCCATTCT TTCAGGACTC
TATCCCAATA CGCAATGGGC GCACAACAAA ATCACCGTCG AGGACGAGAT ACTCGGTTCG
AGCGACGGAT CCCACGAACA GTCGTTCGCC TGCTCGCACG CACCAGTGAT AGACATCGAT
CTCTGGGTGG ACGCACTAGA TACCATGTCG GCGGGCGAGC GACGACGACT CCTGAACGAA
CGGCCGGCAG ACGTCAACCG AGAATACGAC TCTCGTGGTG AACTGAAGGC GTTCTGGGTT
CGATGGAACG CCGTCGACGA CTTTCTCGAG TCGGGACCAC AGGATCGCCA CTACGTCATC
AACCGGACGC TGGGGACAGT TCAGTTCGGT GACGGTGACA ACGGCAAGAT TCCACCGAGC
GGTCAGGACA ACATCAAGGC GACGTATACG ACTGGCGGCG GTAGCGATGG AAACGTCGGT
CCGTGGACGA TTACCGATCT CAAGAGTTCT ATCGCACTGG TCGATACCGT GACGAATCCG
ACGGCGGCCA ACGGCGGGAC TGACGTCGAA TCGACGGATA CGCTCGTCTC GCGATCGACG
AACCGCTTCC GACATCGTGG GCGGGCAGTA ACACCACGCG ATTACGAGCA GGTAGCAAAA
GGCGAGTTTC CGGAACTCGC TCGAGTGTCC TGCGTGACCG ATGCCGAGAA CTGCATAACC
GTCTATATCG TTCCGGACGC ACAGCGGGAA AAGCCCGTGC CGTCGATGGA ATTGAAACAC
GACGTTCGTC AGACGCTGTA CGAACGTGCG CCAGCAACGC TGGTTTCGGA TACAGACCGC
GATATCGTCG TTCGCGGTCC AAGTTACAGT GAACTCACCG TCCAGGCAAC GGTCCGAGCG
AGCAGTGTCA AGAGTGTTTC CCTGCTGAAA TCGACCATCG AAGACCGACT CGACGAGTTC
GTCCATCCAT TGACGGGAAA CAACGGTAAT GGATGGGAGT TCGGAACACT TCCATCACGG
AAATCGCTCG CCGACGTCGT TACCGGCGTC GATGCTGTCG AGGGGGTGTC GAACTTCGAC
GCGACGATCA CGGTCAACGA GGAACGGCGG TCGATCACCG ATCAGCAGCG AGTCGAAACG
CTGCCGAAAA ACACGCTGGT CTGTCACGGG TCACACGAGC TCAGCATCAC CATGACGGAG
GACTAA
 
Protein sequence
MNGTEPIIDD RSQEEIYETL RQRARTYTET WDPESGDVGQ TLVRIFSSFE ADVWNRLNEV 
PEKHLLGFLD ALDFDRRPPQ AARVPLSFDV SRDLDRNVPI PGGTTAIADP TDGEAQQFEL
PQEAGFEATP ASLTDIVAVD PTADAIVDHD TLLENEQTRL FDGELIQSHA LYLENESALN
VAAGSTFTVR VDSQSDPEPI FEETIWEYYG ENEDGEHGWH PLKRVVDDTP MSDDGGVKAL
QEQLQANSST GDDDSRTRDG VAEQRFRLPG EPTIHEVNGT EGRWLRCSLR DGTAVPTWTV
RSISIQVTSS DRETDSETGL DPDMLLSNDV PVALDDERLH PFGRVPHPPV TFFVACEEAF
TKPGGTVELE FTPPAESESE TPQDSSSSAT SGRDGDDGVG VTDSDAMADA LPHLDEETNV
NRGVLDGPPE VSWEYWNGNG WMRLDSVADE TEALRHPGRV QFEVPSDIEP TTVAGHDNVW
IRARLVSGTY DRPSIEDSDG PLSTSFTTRP DPPVYGDVIV EYEHLDLSFN TIVRQNNGAY
SDDLTKREWS FDPFVSLPDD AQTVYFGFDR RLENGPISLF FVIDDATYPQ QFDPGVQWEY
CTVDNEPTWN QMDVRDQTAG LTERGMVMLT FPTPTTEAEL FGRQRHWIRA RLTKDEFRTH
LDVEDEAVAF QSEGGKIGKG SVTDRDRGTT TRARTHDSQS SIPANMTTEQ TTLPPILSGL
YPNTQWAHNK ITVEDEILGS SDGSHEQSFA CSHAPVIDID LWVDALDTMS AGERRRLLNE
RPADVNREYD SRGELKAFWV RWNAVDDFLE SGPQDRHYVI NRTLGTVQFG DGDNGKIPPS
GQDNIKATYT TGGGSDGNVG PWTITDLKSS IALVDTVTNP TAANGGTDVE STDTLVSRST
NRFRHRGRAV TPRDYEQVAK GEFPELARVS CVTDAENCIT VYIVPDAQRE KPVPSMELKH
DVRQTLYERA PATLVSDTDR DIVVRGPSYS ELTVQATVRA SSVKSVSLLK STIEDRLDEF
VHPLTGNNGN GWEFGTLPSR KSLADVVTGV DAVEGVSNFD ATITVNEERR SITDQQRVET
LPKNTLVCHG SHELSITMTE D