Gene Nmag_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1121 
Symbol 
ID8823952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1141293 
End bp1142561 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content64% 
IMG OID 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_003479267 
Protein GI289580801 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGAA TCGACGTCGA AGTCGTCGAC GAGGACGACG CGAATACTGC AGCAGACGGC 
GGGGACGAGG GCGATGGCGA GTCGGACAAC ACCATCGAGG TCACGCCCAC CAGTTCGACC
GACTCCGCAG ACGTAGAGCC GGACGAAGAC CGCACAACCA TCGATGTCGA ACCGTCCGAC
GAGCCAATCG ACGGCCCCGA CTATATCCTC TACGGCGGCA AGGGTGGCGT CGGCAAGACG
ACCATGGCCG CGGCCACCGC ACTCGACAGC GCCCGCGGCG GCACCAGTAC GCTCGTCGTC
TCGACGGACC CCGCCCACTC CCTCTCTGAC ACCTTCGAGA GGGACGTCCC CGCAGAACCC
GCCCGGCTGC GCGAGGATAT CCCACTGTAC GGCGCTGAAA TTGACCCCGA GGCGGCCGCT
GAGCGTGGAC AGGCCGTCTT TGGGAGTAAC GCGAGTGCAG ACTCAGACAC TGACCCCGAG
TGGGAAGCCA ATGGACTCGG CGACGACGGC TTCGGCGGCG ACAGCAGCCC CTTCGGCGAC
GACCAGGGCG GCCTCGGCGG CATCGGCCAA CTCCTCGGCG ATGACAATCC CATGGACGCC
CTCTTCGGCG GTTCCATGCC AGGTGCCGAC GAAGCTGCCG CGATGCAGTT GCTACTCGAG
TACATGGATG ACCCGCGATT CGAGCGCGTC GTCATCGACA CCGCGCCGAC GGGTCACACC
CTTAGACTGC TCCAGTTACC CGAAATCATG GACTCGATGG TCGGGAAAAT TCTGCAGTTC
CGCCAGCGTA TGAGCGGCCT CTTCGAGGGA ATGAAGGGGA TGTTCGGCGG ACAGGACCAG
CCGGCAGACC AGACGCCGGA TCTCTCCGAT CTAGACGAAC TCCAGGAGCG GATCGAGCGC
CTGCGGGCGG CGCTCCAGGA TCCGACACGG ACGGACTTCC GGATCGTGAT GATTCCCGAG
GAGATGAGCG TCTACGAGTC GACGCGGCTG CGCCAGCAGC TTCAGGAGTT CGACATTCCG
GTCGGTACGG TCGTCGTCAA CCGTGTGATG GAGCCGCTGT CGAACGTAAC CGACGACGTG
CGCGGCGAGT TCTTACAGCC GAATCTGGAC GACTGTGAGT TCTGCCAACG GCGCTGGGAT
GTCCAGCAGT CCGCCCTTGC TGAGGCACAG GACCTCTTTC GCGGACCGGA CGTTCGGCGC
GTCCCGCTGT TCGCGGATGA AGTCCGTGGT GAGGGCATGC TCGAGGTCGT GGCGGCCTGT
CTGCGATAA
 
Protein sequence
MSGIDVEVVD EDDANTAADG GDEGDGESDN TIEVTPTSST DSADVEPDED RTTIDVEPSD 
EPIDGPDYIL YGGKGGVGKT TMAAATALDS ARGGTSTLVV STDPAHSLSD TFERDVPAEP
ARLREDIPLY GAEIDPEAAA ERGQAVFGSN ASADSDTDPE WEANGLGDDG FGGDSSPFGD
DQGGLGGIGQ LLGDDNPMDA LFGGSMPGAD EAAAMQLLLE YMDDPRFERV VIDTAPTGHT
LRLLQLPEIM DSMVGKILQF RQRMSGLFEG MKGMFGGQDQ PADQTPDLSD LDELQERIER
LRAALQDPTR TDFRIVMIPE EMSVYESTRL RQQLQEFDIP VGTVVVNRVM EPLSNVTDDV
RGEFLQPNLD DCEFCQRRWD VQQSALAEAQ DLFRGPDVRR VPLFADEVRG EGMLEVVAAC
LR