Gene Nmag_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3901 
Symbol 
ID8826771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp297358 
End bp298647 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content63% 
IMG OID 
Productpeptidase M24 
Protein accessionYP_003482004 
Protein GI289583594 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.337713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTACC TATCCATGTC GTTTCACGAC CGGCAGTTTA TGGCGGGTAC TCGAGGAACG 
CAGGCGGTCG ACTGGGAACA GCGCATCGAT ACCCAGCGCC TCCGCGAAGA GCGCAAAGCG
AGGGCGCTCG AACGCCTCCA GGAGACCAAC CTCGGGGCCA TGCTCCTCGT CTCGGATCCG
AACATCCGCT ACGTGACCGG GCTGGCGATG ACCGGTGGCA GCGGCGCGGA CCACTACACC
CTCCTTACCG AAAACGGCGA CATCGTTCAC TGGGACACCG CGGACCACGC GAGCAACCAG
CGGTTCAACT GCCCGTGGCT TCACGACATC CGTTATGCCT GTCCGGGGCT CGGCAACGTT
CCGCGAGCCT CTGGCAGCGC CTCGGCCCGC CAGTTCCTGC GATCGAAGAT GGCCGAAACC
GTTCACGACG CGATGGAAGA GTACGGCGTC GCCGACGAGA CGCTCGGGAT CGATATCGGC
AATCAGAGTC TCGTCTCGGT GTTCGAGGAC CGCGGCGTTG ATGTCGATGT CGACACCGCA
CAAGCGGTGA TGGAGGACGC CCGGAAGATC AAAACCGAGG ACGAGATCGA GTGTTTACGG
ATGGTCGCCT CGATCTGTGA GGCTGGCTTT CAGACCATCA AGGACACCGC CAAGCCGGGG
ATGCGCGAGA CCGAGGTCTG GGGCGAAGCC GTCCGCGAAC TCTGGCGTCA CGGCGCGTTC
GTCGGCGGCG GCTACGTTAC GTCGGGGCCG AACACGTGGC CCAAACACCA GGCGAACACC
ACCGACCGGG CGATCCGCCC GGGCGACCTC GTCTACGCCG ACTTCTACAA CATCGGCTAC
CTCGGCTACC GGTCGTGTTA CTACCGCACC TTCTCGATCG GCCAGCCAAC GCAGGCACAG
CAGGACGCCT ACGAAAAAGC ACGGGACGAT CTGTACAACG TACTCGAGTG CATCGAGCCC
GGTGCGACGA CCGACGAGAT CTGCCAGGCG TTCCCGGACG AAGAAGGCGA GCACATGGAC
TGGTACGACG CCGACGAGTT CTGGGAGATG ACGACGAATC ACTGGGCCCA CGGTCTCGGG
CTCCAGCTCT ACGAAGTGCC GCTGATCTGG CGTGGCCTTT CACCGGACCA TCCGATCGAG
ATCGAGGAGG GGATGACGAT GGCCGTCGAG ACGATGCAGC CGGCGGATAG ACAGGGTGTC
CGCGTCGAAG AGATGGTCGT CGTTCGCGAG AACGGCGTCG AGATTCTGAG TCAGTGGCCG
GTCGAGGAGA TTACGGTTAT CGACCACTGA
 
Protein sequence
MRYLSMSFHD RQFMAGTRGT QAVDWEQRID TQRLREERKA RALERLQETN LGAMLLVSDP 
NIRYVTGLAM TGGSGADHYT LLTENGDIVH WDTADHASNQ RFNCPWLHDI RYACPGLGNV
PRASGSASAR QFLRSKMAET VHDAMEEYGV ADETLGIDIG NQSLVSVFED RGVDVDVDTA
QAVMEDARKI KTEDEIECLR MVASICEAGF QTIKDTAKPG MRETEVWGEA VRELWRHGAF
VGGGYVTSGP NTWPKHQANT TDRAIRPGDL VYADFYNIGY LGYRSCYYRT FSIGQPTQAQ
QDAYEKARDD LYNVLECIEP GATTDEICQA FPDEEGEHMD WYDADEFWEM TTNHWAHGLG
LQLYEVPLIW RGLSPDHPIE IEEGMTMAVE TMQPADRQGV RVEEMVVVRE NGVEILSQWP
VEEITVIDH