Gene Nmag_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3971 
Symbol 
ID8828705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013924 
Strand
Start bp7934 
End bp9949 
Gene Length2016 bp 
Protein Length671 aa 
Translation table11 
GC content62% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003482069 
Protein GI289937467 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.330522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATCAA CACAACTGTG GGGTATGCTC ACTCTCAACG ATGTTCTCGA TCTTGAATGG 
CCCACTGCCC CCAAGTGGTC GTCGTGCGGT TCGTTTCTCG CCGCAACGGT ACACGAGGAC
GATGGGAAGG TATTGCTCGT TGGTCACCCA GGAAAAGAGC CGTTGCGCGT CCGACCGGCC
GACGAACACG TCGCCGAGTT CACGTGGTCG CCAACGGAGC CACAACTCGT CTGTACGACC
GAGGATGGAT TGACTGCCCT CGTTGATCCA CGTGAGCAAT CCGTCTCGGA GATTGCGCGC
ACACCCGACG GCGACATCTT GCCCACCTGG TCGCACGATG GGTCGCGCCT CGCGTTCTAT
CGGGATGGCA AACCCCGAAC TAAGTCACTG ACCGACGGCG TCGAACGCGG GTTCGATGTT
CCGGAGCGGG GCCCATTTCT TGGTGGAGAA CGCGCGTTCG CCTGGCGTGG GGACGGGCTG
CTTGCCTACC GATTCACAGA CTGCGAGACG AAGTGCGTCG GCGTGATCAA CACCGAAAGC
GGGGACCTCG TCTGGCGAAC CCGCCCTGAT TCTTCCTCTC ATTCACCGTT CTGGCTCGAC
GATGGACGGC TGTGCTACGA GCGCCGAGGT GAGTACGGTA CCGTCCGCGA GTTTATCGCC
GTCGACATCG ATGATCCTGC GACCGAGAGT ACACTCTTCC AGGAGGTCGA CGAAGAGACC
GGCGCACTCT CGCGTGGGTC ACCACGTCTC TCCCCCGATG GAACACTGCT GGCCGCGGCG
CTCCCAGTCG ATGGCTACGA ACACATCCAC GTGATCGATG TCTCCTCAGG CGAGCGAACG
CAACTCACTG AGGGGGCCTT CGAGGACAAA GGACTCGCCG ATTCTACGCC GCGATGGATC
GACGATGAAC GATTAGTTTT CGCATCAAAC CGGCGAGACA CAGGACAGCG ACAGCTCTAC
ACGGTCACGC TCGATGGCAC GACAGAACCG CTGGTCGAAA CGGCGGGAAC AAACGTCGAA
CCGCGACCGT CGCCAGCCGG CGATCACGTC GCCTACCTCC ATGCAAGCCG CGACCGGTCT
CCGGAGGTTC GTGTCCGCGA ACTCGAGTCC GATTCGGCTG ATGCTGCCCC GACCGAACCA
GCGACGCAAA CCGACGCCAA GCTGACCCAC TCCGGTCTCC GCGACTGGCC GGAACCACCG
ATCGAACCGG AGCGAGTCTC GTTCGAGAGT GGTGAGCTGA CGATCGAGGG ATATCTCCTC
GACCCACGCC AAAGCGAGTC CGTGCCCGAC GACGCAACCG ACCTCCCGAG TGTCGTCTAC
GTCCACGGCG GGCCGATGCG ACAGATACGT GATGGCTTCC ATCCCTCCCG ATCGTACGGG
CTTGCGTACG CGTACCAGCA GTACCTCGCC ACGAACGGGT ACGTCGGTCT GTTCGTGAAC
TACCGTGGTG GCATCGGCTA CGGCCGGGCG TTCAGAGGAG CGATCGGCGG CGACCGAGGG
AGGGTCGAGA TGGACGACAT CGCTCGCGCC GCTGACTATC TGCGTGCTCT CGAGTACACC
GCTGATTCAG TAGGACAATG GGGACTCTCC TACGGCGGCT ACGCCGCGCT TCAGCTCCCC
GGAACCCACC CTGGAACATT TGATGTCACG GTCAACATCG CCGGACTCGC CGATACAGCG
AACTACCACG AGTGGGCGAC CGAGACGAAA TTCCCCGCCA TCGCCTCCGC AGCGACGACC
GTGATGGGCC ATCCACTCGA GAACCCCGAC CGGTGGGATG ACGCTAGCCC GGTCACCCAC
ATGGACCGAT ACGAGACGCC GGTGTACAAC TTCCACGGCA CGGCTGACCG GTACGTGAAC
GTCGAGCAGC AGGATATCGT GGTGAACACG CTGCTCGATC TCGATGTTGA GTTCGAGGCC
GAGTACTATC CTGACGAAGG GCATGTGTTC TCGAAGCGAT CGACGTGGCG GCGAACGTTC
GAGAAGATCG AAGCGGCGTT CGACGAGCAC CTCTAG
 
Protein sequence
MESTQLWGML TLNDVLDLEW PTAPKWSSCG SFLAATVHED DGKVLLVGHP GKEPLRVRPA 
DEHVAEFTWS PTEPQLVCTT EDGLTALVDP REQSVSEIAR TPDGDILPTW SHDGSRLAFY
RDGKPRTKSL TDGVERGFDV PERGPFLGGE RAFAWRGDGL LAYRFTDCET KCVGVINTES
GDLVWRTRPD SSSHSPFWLD DGRLCYERRG EYGTVREFIA VDIDDPATES TLFQEVDEET
GALSRGSPRL SPDGTLLAAA LPVDGYEHIH VIDVSSGERT QLTEGAFEDK GLADSTPRWI
DDERLVFASN RRDTGQRQLY TVTLDGTTEP LVETAGTNVE PRPSPAGDHV AYLHASRDRS
PEVRVRELES DSADAAPTEP ATQTDAKLTH SGLRDWPEPP IEPERVSFES GELTIEGYLL
DPRQSESVPD DATDLPSVVY VHGGPMRQIR DGFHPSRSYG LAYAYQQYLA TNGYVGLFVN
YRGGIGYGRA FRGAIGGDRG RVEMDDIARA ADYLRALEYT ADSVGQWGLS YGGYAALQLP
GTHPGTFDVT VNIAGLADTA NYHEWATETK FPAIASAATT VMGHPLENPD RWDDASPVTH
MDRYETPVYN FHGTADRYVN VEQQDIVVNT LLDLDVEFEA EYYPDEGHVF SKRSTWRRTF
EKIEAAFDEH L