Gene Nmag_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1722 
Symbol 
ID8824562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1756778 
End bp1759066 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content66% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003479860 
Protein GI289581394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGA TCGAGGCGAC CGACTTCTAC GACCTGCGGC AGGTTTCGGA TCCACAGCTC 
TCGCCTGGCG GCGAGCGCGT CGCATACGTC GAACAACTCC CCGAAGACGA GGAATCCTCC
GAGGCAACCA TCCACGTCGT CCCCGTCGGC GGTGACGAGC CAACGCAACT CACGATTAGC
GAGGGTGTCG ACAGCCAGCC CCGCTGGAGC CCCGACGGCG ACCGCCTCGC GTTCGCCAGC
ACCCGCGGCG AGGACGACGA CCGCCAGCAA CTCTGGATAC TCCCCACGAC GACCGGCGGT
GAAGCCCGCC AGCTCACGTC CGTCGTCGGC GGCGTCACCG GACTCGAGTG GAGTCCCGAC
GGCAGCCGTC TCCTGTTCAC CCAGCAGGTC ACCGCCGACG ACCGCGAGGA GGGGCGCGAC
CTCGCCGTCG ACCCTGAGTA CGAACCCGAG ACGCCCGACC CGCGCGTTAT CGACCGCATG
ATCTACCGCG CCGGTACGGA GTACATGGAC GGCCGGCGGA GTCACGTCTA CGTCCTCGAT
ATCGAGGCTG CACTCGAGTC CGATCCATCC GATCCGACTG ATACTGATTC CGAGGATACG
GACGCGATCG AACGCCTCAC TGACGGCGAC GAAGACCATA TCGGAGCGAC CTGGGGTGAC
GACGAGACGG TCTACTACGC GGTCAAAACC GCCGAGGATG CGGTCGAGGC AGACGACTCG
AGCCGGTACG ACCTGTACGA ACACGACCTC GAAACGGATG AGGCAACGGC CTTCACGCAG
ACGACCGGCT GGGTCACCGA ACTCGAGGCG ACGACAGATG GGCGGATCGC GTACGCATTC
ACGCCTGAAG AACAGGCCTC GCTACGCCAG ACTGACGTTC GCGTGCACGA CCGCGAGACT
GGCGCGGAGA CGACGCCGAC GGCGTCGCTG GATCGGACGA TCGACGGCAA CTTCACCTGG
GCGCCGGACG AGGAGCTGCT GTACTTCACG ACGCCGGATG AGGGGGCGAA CGTGCTCTGG
TCGGTCAACG TGCCGACGAC GATCGACAGC GATGAGGAAG CACTCGAGGA GCCGACCCGC
GTTTACGGCG ATGACGTGAC TGTGTCGGGC TTCTCGGTCG GCGACAACGC CGTCGCGCTC
GTCCAGAGCG AGTGGGACCA CCCTGGCGAC ATCTTCGTGA CGACCCGCGG CGGAAACGAG
ACGCACCGAC TGACGCGCGT CAACGGCGAC CTCCTCGCGG ATCGAGCGGT GCGCCAGCCG
GAGGAGATCT GGTTCGAGAG GGGTGACGCC GGGATTGAAG ATGCCGACGG CAACGGCAAC
AGCGACGGCG ACGGCAACAG CGACGACGGC GAACGCAACC AGATCCAGGG CTGGCTGCTA
ACCCCACCTG AATTCGACGC CGACGCCGCG AGCGGCCCGG ACGAAACCTA CCCGCTCGTC
GTCGAAATCC ACGGCGGTCC CCACTCCCAG TGGACCACCG CGGGGACGAT GTGGCACGAG
TTCCAGACGC TCGCGGCACA GGGCTACGTC GTCTTCTGGT CTAATCCCCG CGGCTCGACG
GGCTACGGCG AGGACCACGC CATGGCGATC GAACGCAACT GGGGCGATGT GACGCTTGCG
GACGTACTCG CGGGCGTCGA CGAAGTCTGT GAACGCGATT TCGTCGACGA GGACGAACTG
TTCGTCACCG GCGGGAGCTT CGGCGGATTC ATGACCTCGT GGGCGGTCAC CCAGACGGAC
CGCTTCACGG CCGCAGTCTC CCAGCGCGGC GTCTACGATC TCACCAGTTT CTACGGCTCG
ACGGACGCGT TCAAGCTCAT CGAGGGCGAC TTCGACACGA CACCCTGGGA GGAGCCCGAA
TTCCTCTGGG AGCAGTCGCC CGTTGCACAC ATCCCGAACG TCGAGACGCC AACGCTCGTG
CTCCACTCTG ATCGGGACTA CCGGACGCCT GCGAACACCG CCGAACTGTT CTACCTCGGT
CTGAAGAAAC ACGGCGTCGA CACCCGTCTC GTCCGGTATC CGCGCGAGGG CCACGAACTC
TCGCGTTCGG GTGAGCCGGG CCACATCGTC GACCGACTCG AGCGCATCGT CCGCTGGTTC
GACGGCTACG CCGACTCCCG TGAGGTCCCA CCGGCACTCG AGCGCGACCC GAACGCGGGG
CTGTCAGGAG GACTGGACGA GTCAGATGAT GGGGAGGGCG AGAGCGCGAG TGAGAACGCA
AACACAAACG CGAACGCGAA CGTAAACGCG AACGCGAACG GGAACGAGGC GACGAGCACA
GACGACTGA
 
Protein sequence
MNTIEATDFY DLRQVSDPQL SPGGERVAYV EQLPEDEESS EATIHVVPVG GDEPTQLTIS 
EGVDSQPRWS PDGDRLAFAS TRGEDDDRQQ LWILPTTTGG EARQLTSVVG GVTGLEWSPD
GSRLLFTQQV TADDREEGRD LAVDPEYEPE TPDPRVIDRM IYRAGTEYMD GRRSHVYVLD
IEAALESDPS DPTDTDSEDT DAIERLTDGD EDHIGATWGD DETVYYAVKT AEDAVEADDS
SRYDLYEHDL ETDEATAFTQ TTGWVTELEA TTDGRIAYAF TPEEQASLRQ TDVRVHDRET
GAETTPTASL DRTIDGNFTW APDEELLYFT TPDEGANVLW SVNVPTTIDS DEEALEEPTR
VYGDDVTVSG FSVGDNAVAL VQSEWDHPGD IFVTTRGGNE THRLTRVNGD LLADRAVRQP
EEIWFERGDA GIEDADGNGN SDGDGNSDDG ERNQIQGWLL TPPEFDADAA SGPDETYPLV
VEIHGGPHSQ WTTAGTMWHE FQTLAAQGYV VFWSNPRGST GYGEDHAMAI ERNWGDVTLA
DVLAGVDEVC ERDFVDEDEL FVTGGSFGGF MTSWAVTQTD RFTAAVSQRG VYDLTSFYGS
TDAFKLIEGD FDTTPWEEPE FLWEQSPVAH IPNVETPTLV LHSDRDYRTP ANTAELFYLG
LKKHGVDTRL VRYPREGHEL SRSGEPGHIV DRLERIVRWF DGYADSREVP PALERDPNAG
LSGGLDESDD GEGESASENA NTNANANVNA NANGNEATST DD