Gene Nmag_3646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3646 
Symbol 
ID8826514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp29081 
End bp30226 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content65% 
IMG OID 
Productpeptidase M42 family protein 
Protein accessionYP_003481756 
Protein GI289583346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0943449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTCA AACGGACGGA ACTCGACCGA CTCGTCGCTG CTCGCGGCGG TCCCGGTGGC 
GAGTATCACG TCGCCCGTGT TTTCGAGGAA TTGATCGAGC CGTACGTCGA CGAGGTGTCG
TGGGATTCGA TGGGAAACGT CGTCGCGACG AGTTACGGTG AGGACGACTC CAGTGCTGAC
GATACCGACG ATAGCGGTTC GAACGGGACC GACACTGGTA CCGGCATCGA CAACGACACA
GACACCAAAG ACGTCCTCCT CGCCGCCCAC ACTGACGAAC TCGCGTTTCT GATCGACGAT
ATCACTGAGG ACGGCCTCTG CTCGTTCTCG ATGCTTGGCG GCCACTACCG GGGCTATCTC
CCCGGGCAAC ACGTGCTCGT CGGCCCCGAC AAGGTTCCCG GCGTCGTCGG GACGAAGCCG
CGACACTTCA TGGACGGCGA CGAGAAAGGC AGTCTCCCTG AAACCCTGCA CATCGACCTC
GGTGCCCGAA GTCAGGAGGA AGTGGCCGAA CTAAACGTCG AACCCGGCGA CCACGCAACC
TGGGACCGCG AACTAACCGA CCTCGCAAAC GGCCGACTCG CGGGCCGAGC GCTCGACGAC
CGCATCGCAC TCGCAATCCT CGTCGCCGTC GCCCGCGAGA CCGACTCGGA TCGAACCGTC
CACTACGCCG CCACCGTCCA GGAAGAAGTC GGCCTCCGCG GTGCCCGTGC TGCAGTTCAC
GAGGTGGATC CTGACATCGC CATCGCACTC GAGATCTTCC CGAGCGACGA CTACCCGATC
GACGGCGACC GATCGAGTAC CGTCGAACTC GGCGCTGGTC CGGTAGTGGA GTTCGGCGAC
GGCACCTCCG AGTACCTCTT CGGTGGCGTC CTCGTCGATC GACAGACACT CGAGTGGCTC
ACAGCCGCCG GGTCGTCGGC CGACGTGACC CTCCAGCACG ACGTCATGAT CGGGGGCACG
ACCGACGCGA CGGAGTTCCA GAGTGCCGGC CGGGCGCGCC ACGCTGGCGC GATTGCTGTC
CCCTGTCGGT ACACGCACTC ACCTGTCGAG ACGATCGACC TCGACGACGC CGAGGAGACG
GTCGATGTGC TCGTCGCTGC GCTGGAATCG CCGTTCCCGG GCCGGACTGA CGTGCGCGGG
CGTTAG
 
Protein sequence
MALKRTELDR LVAARGGPGG EYHVARVFEE LIEPYVDEVS WDSMGNVVAT SYGEDDSSAD 
DTDDSGSNGT DTGTGIDNDT DTKDVLLAAH TDELAFLIDD ITEDGLCSFS MLGGHYRGYL
PGQHVLVGPD KVPGVVGTKP RHFMDGDEKG SLPETLHIDL GARSQEEVAE LNVEPGDHAT
WDRELTDLAN GRLAGRALDD RIALAILVAV ARETDSDRTV HYAATVQEEV GLRGARAAVH
EVDPDIAIAL EIFPSDDYPI DGDRSSTVEL GAGPVVEFGD GTSEYLFGGV LVDRQTLEWL
TAAGSSADVT LQHDVMIGGT TDATEFQSAG RARHAGAIAV PCRYTHSPVE TIDLDDAEET
VDVLVAALES PFPGRTDVRG R