Gene Nmag_3997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3997 
Symbol 
ID8828731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp38508 
End bp39878 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content56% 
IMG OID 
Productpeptidase M28 
Protein accessionYP_003482092 
Protein GI289937490 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.664765 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAGTT TACCCAACCA GATCGTTGGC GATGCATACA CGAGTACGCA CGGCTGGCAA 
CTTGTCGAAG CACTTGCTGA CCTCCGCGAC CGGATGCCAG GCAGCGAGGG GGAACGAGCT
GGTGCCGATC TAGTGGCGGA ACAATTCGAG GAGATCGGAC TCGATAACGT CTCAAGCACG
GAATTTCAGA TTCCGGGCTG GGAGCGAAAC TCCGCTTCTG TTACTGTTGA CGACTACGAC
CTGTTCAAGA GATCCCACGA AGTGGTCGCA CTTCCTGGAA CTCCAGCAGA AACGACGTCC
GCCGAACTCA TTGATATGGG GCATGCCCTT CCCGAAGATT TCGAAGACGT TGATCTGGAC
GGGAAGATCG TGATGGCCTC AAGCCTTACA CCCGACGATT ACGGACGGTG GGTTCACCGA
GGTGAAAAAT ACTCCTATGC CATCGAAGCT GGCGCAGCCG GGTTTATTTT CGTCAACCAT
ATTGAGGGCT GTCTTCCGCC GACGGGCAGT ATTGGTGATC GGAACGGACC TGGGGCAATC
CCAGCGGTCG GCGTCTCGAA GGAAGTCGGG GACAGGATCA AGCGATTCTG CAGAGACAAT
ACAACCGAGG CAACGATCGC AGTCGATTGT CAAAATACCG AAGCAACCTC TCGAAATATC
GAGGCCACCG TCGGTCCGGA CACTGAAGAA GAGGTGCTCT TTACCGCCCA TGTTGACGCC
CACGATGTCG GTGACGGGGC GAACGACAAT GGCGTCGGAT GTGCGCTCGT GACTGAGGTC
GGGCGACTCC TCAAACAGAT CGAAGACGAT CTCGAAACTC GCGTTCGATT AGTGACGTTC
GGGGCCGAAG AAACCGGATT ATACGGCGCC TATTACTGGA CGCATACACA CGACCTCGAC
CAGGTCAAAT GCGTTCTCAA TATGGACGGT GCGGGATACT CGCGAAACCT CTCGATCCAT
ACCCATGGCT TCGACGCAAT CGGTGAAGCG TTCGAAGAAG TGAGCGAGGA ATTCGGCGTA
CCGATTGATG TGGAATCCGG GATCCGCCCA CATAGCGATC ATTGGCCGTT CGTACAACGT
GGCATCCCTG GAGCACAGGG GCGAACAACT GCCGAAGATA GCGGACGCGG ATGGGGACAC
ACCCATGGCG ATACCCTCGA CAAACTAGAT ATTCGTGATC TCCGCGAGAT ATCGACGCTT
CTGACAGCTG GCGTCCTCAA ACTCTCAGAG ACCAGTCGAG AAATTGAACG GGTCGACGAC
GTTGAAATAC GCGATGCAAC GGAAGAGCAA CGGTTTGATG TCGGAATGAA AGCGACCGGT
AGCTGGCCGT GGGGCGATGA GGTTCGCGTC TGGCCCTGGG ACGATGCCTG A
 
Protein sequence
MVSLPNQIVG DAYTSTHGWQ LVEALADLRD RMPGSEGERA GADLVAEQFE EIGLDNVSST 
EFQIPGWERN SASVTVDDYD LFKRSHEVVA LPGTPAETTS AELIDMGHAL PEDFEDVDLD
GKIVMASSLT PDDYGRWVHR GEKYSYAIEA GAAGFIFVNH IEGCLPPTGS IGDRNGPGAI
PAVGVSKEVG DRIKRFCRDN TTEATIAVDC QNTEATSRNI EATVGPDTEE EVLFTAHVDA
HDVGDGANDN GVGCALVTEV GRLLKQIEDD LETRVRLVTF GAEETGLYGA YYWTHTHDLD
QVKCVLNMDG AGYSRNLSIH THGFDAIGEA FEEVSEEFGV PIDVESGIRP HSDHWPFVQR
GIPGAQGRTT AEDSGRGWGH THGDTLDKLD IRDLREISTL LTAGVLKLSE TSREIERVDD
VEIRDATEEQ RFDVGMKATG SWPWGDEVRV WPWDDA