Gene Nmag_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3220 
Symbol 
ID8826083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3338456 
End bp3341290 
Gene Length2835 bp 
Protein Length944 aa 
Translation table11 
GC content63% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003481332 
Protein GI289582866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0653262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTCGC GCACGCAAGA GCGCGTCGAG CAGTGGGATT CTCGCCCGTT CAGCGGTGGT 
TTCGATGACC TGTCTACTCT CGCCGACAGT GACTTTTCGG GAGCAGTCGC TGCATCTGGC
GCGTGGCTGT TTATGCTTAA CGGCCGCGTC GTTGGCATCA TCGACGGTGA AATAACTGAC
TTCGAGGACG CAGACGGGAC CGCTTACCGC GCCCCGGACC CGGCACTGCC GTTGCTCTGT
ACGATGGAAG AGCGCGGCGG CGACGTACAG GCGAAGTACT ATACGAACAA AACGCCGCTT
CGCGAAGTCG ATCAGACGCT CCAGAACGGT TCGTTTACGG GCTACATCGA ACTGAGCGAG
AACGTTCTCA GTGGGGATTA CTACGCCATC TACTACGGCG GTCGGCGGAT GGCAGCAGCC
TACATCGGCA ACGCCGAGCG GTTGCTCACC GGTGACGAAG CGTTCGAGCG GGCCGACGAC
GAGGTCGGCA TTTATCAGGT CCTCAACGTC GATATCGATG TCACCGACAT CCCCGGCACA
GGCGCGTCCG AGGAGACGGG ACGGACAGGA ACGACCAACG ACGAAGACAC GGCTGAAACT
GGGCAAGGCA CAGGAGTTGG TGACGCAAGC CACACCAGTG ACGCTGCCGG CAGTGTCGAC
GACGCCGGAA CACGGGCCGA ACCGAGCCTC GACCCTACCG AGTCGGCGAT CGACCAACTC
GACGTTTCGG ACGGTACGGG CGCGAGCGCA GGAACGGGAG CAGACACTGA CATCGACACA
GCTCTCGACA CAGAGCTACC CTCGGGCGGT GACGCTGGAG TCGGGGCCGA AACCGGAAGC
GAGGCTGGAA CTGGCTCCGG AATCACGACC GACGACTCGA GTACCATCAC CGAGGCCGAG
ACGGACGGGC CGGGTGGAAT CACTGCACCA GTGGCATCTG ACGACGAGTC GGCGTCGACG
GAACAGAGGG AGGTCGGCGC GGAGGCTGAC GCTGAGCGCG AGTCCGACGC CGGTACGCAT
GCGCCGGCCA GCGCGGGCGA AGAGGTGACG GGAGCGGCCG AGGTCGAAGA CAGTCCCAGC
GACGGTACAA ACCGGGCGGG CATGTCGAGC CCCGATCCGG AAACGGTCGA AGCGGCGGCA
GAAGAACTCG AACAGAACGA TATCTCCTGG ATCGAAGAGG ACGGTGACGG TGCGGACGCC
GGTGCTGGCG ATGCCAGCGG TGTGTCGGAG GCGACGCCGG CACCGCCAGT CGAGGCAGAG
ACGGAGCCAG GGCCGGCAGA CGGTGACACA GAAACAGCCA CAGCCACAGC CACAGAAACA
GCCACGGCCG CAGCCACAGA CACAGCCACG GTCGCAGACG CGAACAGCGA CACAACGGCC
CGTGACGAGA CCGATCCCGA CGGAAACACG GACGATGACG AGGACGGCGA ACTCGATCAG
CAACTCGAGG CCGAAGAGGC CTGGCGAGAA ACCCGATCGA TTCCGTCGAT CGATCCGGAC
AATTCGGCGG CGAGTGAGGA TGGGGCGCGC ACTGGTGCAC CCTCGCGTGC AGCCAGTAGT
CGGTCGCGAT CGCAATCACA GTCACAGTCA CAGTCACAGT CACAGTCACA GTCACGGTCG
TCAACAGCAT CTTCCAAACC GAGCACGCAG TCCCAGTCAT CAGCTGACCA GTCGGGATCG
AACCAGCGTC GCGCTACCAA CGCGGATCGA TCCAACACCA GTTCGAACGC CGGTGGCGAT
TCCCGCAGTA CAGACTCTCC ACAGTCACAC TCGCAGACAT CGTCTCGCCA GCGCAGCGAC
GGACAGAACA GTACCGAACA GAACACGTCC GGAAGCGGTG CAGGCGACGC ACACAAACGC
AACCTCGCCC AGTACACCCG CCGGATCGAG GAACTCGAAC AGAAACACAA CGCCCTCGCA
GAGAAGGCCA GGGAGCTCAA AACCGAACGT GACGAACTGC ACGCCGAAAA TCAACAGCTC
ACGAGCACGG TCGAGTCGCT CCAGACGCGA GTGAGTGAAC TCGAGTCCGA ACTGGAACAG
GCACGGGCTG GTGGCGGTGG CAGTGGCAGT GGCGGTGACG GTGACAGCCA GTCCGGGTTT
GACGCCGAAA CGCACCTCTC GCCGACGGAG GCACTGTCGG GAACGAACCT CTTCGTGCGC
TACGACTCGA AGAGCCAGCC AACACTCGAG ACGGCCCACG ATGGCGCAGC GGATCGCAAC
GAGGTTGCCT CGAATCTCCG GCTCGAACAC CACACCGAGT TCGACACAGA GACGGTCGCT
GTCGACGGCC AGCCGTACGA ACGTTTCCTC ACTGAGACTA TCGAGTACTC GTTCGTCGAC
TGGCTCACCG AGATGGTGCT GTACGAGATT CGCGATACGG GACACGCGGA CGGGCTAGCG
GATCTCTACG ACGCGATTCC GCGAATCGAC CGCGCAGAAC TGGGAGCGAC GATCTCGCTC
GCGGATGACG ACACCGAAGA CGTCCCTGAC GAGGTCACCT TCGATGTTGT CGCGTTCGAC
AAGATGGGCA ACCCGCTCGT TCTCGCGACA CTCAACGACT CGCGCGAGCC CGCGAGCCAG
GCGCTACTCG AGGAACTCGA GGTTGCGGCC TCGGCGGTCA AGGCGAACTA TCCGGATCTT
GCAGCGGCAG TCGCGGTGAC CTCGAGTTAC TTCGAGCCGG GTGCACTCGA GGTGACAGAG
CAGGCGACGA GCAGCGGATT CTTGAGCCGC GGTTCGAAGC TGAGTTACGT GAACCTCTCG
CGCAAGAGTG GCTATCATCT CTGTCTGGTG GAGTCACGCA GTGAAGGGTT CCATATGAAC
GTGCCGGAGT TGTGA
 
Protein sequence
MDSRTQERVE QWDSRPFSGG FDDLSTLADS DFSGAVAASG AWLFMLNGRV VGIIDGEITD 
FEDADGTAYR APDPALPLLC TMEERGGDVQ AKYYTNKTPL REVDQTLQNG SFTGYIELSE
NVLSGDYYAI YYGGRRMAAA YIGNAERLLT GDEAFERADD EVGIYQVLNV DIDVTDIPGT
GASEETGRTG TTNDEDTAET GQGTGVGDAS HTSDAAGSVD DAGTRAEPSL DPTESAIDQL
DVSDGTGASA GTGADTDIDT ALDTELPSGG DAGVGAETGS EAGTGSGITT DDSSTITEAE
TDGPGGITAP VASDDESAST EQREVGAEAD AERESDAGTH APASAGEEVT GAAEVEDSPS
DGTNRAGMSS PDPETVEAAA EELEQNDISW IEEDGDGADA GAGDASGVSE ATPAPPVEAE
TEPGPADGDT ETATATATET ATAAATDTAT VADANSDTTA RDETDPDGNT DDDEDGELDQ
QLEAEEAWRE TRSIPSIDPD NSAASEDGAR TGAPSRAASS RSRSQSQSQS QSQSQSQSRS
STASSKPSTQ SQSSADQSGS NQRRATNADR SNTSSNAGGD SRSTDSPQSH SQTSSRQRSD
GQNSTEQNTS GSGAGDAHKR NLAQYTRRIE ELEQKHNALA EKARELKTER DELHAENQQL
TSTVESLQTR VSELESELEQ ARAGGGGSGS GGDGDSQSGF DAETHLSPTE ALSGTNLFVR
YDSKSQPTLE TAHDGAADRN EVASNLRLEH HTEFDTETVA VDGQPYERFL TETIEYSFVD
WLTEMVLYEI RDTGHADGLA DLYDAIPRID RAELGATISL ADDDTEDVPD EVTFDVVAFD
KMGNPLVLAT LNDSREPASQ ALLEELEVAA SAVKANYPDL AAAVAVTSSY FEPGALEVTE
QATSSGFLSR GSKLSYVNLS RKSGYHLCLV ESRSEGFHMN VPEL