Gene Nmag_3662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3662 
Symbol 
ID8826530 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp49363 
End bp51228 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content63% 
IMG OID 
Productpeptidase S9 prolyl oligopeptidase active site domain protein 
Protein accessionYP_003481772 
Protein GI289583362 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0540328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGTG ATGTCGGGCG CTACTTGAAC GCTCGCGGAA CAAGAGGAGC GACGGTCGCA 
CCGGACGGTC GCCTGGCGTT TCGCGCGGAC ACGACGGGAA CGATGCAGGT CTGGACACTC
GGCGAGCCGG GCGGTTGGCC GACCCAGCAG ACGTTCTACG ACGAGCGCGT TTCGTTCGTC
GACTGGTCAC CGAGCGGCGA CGTGATCGCG TTCGGTAAGG ACACCGGCGC CGACGAGCAC
GATCAGTTGT ACCGACTCAG TCCCGACGAC CGGACGATCG TCCAACTGAC GGACAGACCA
GATGCGATCC ACCAGTGGGG CGCCTGGAGC CCCGATGGAG ACCGAGTCGC GTTCACCGCC
AACCGTCGTA CCACCGCTGA TTTCGACGTG TACGTAATGG CTGTCGATAC CGGCGAGGGC
GGGACCGATC ACGAGGGTGG AGCCGACGGG AGCGAACCGA CACGCGTCTG TGAGGGCGAC
GGTCGCATCA GCGTCGTCGG CTGGAGTCCG TCCGGTGACC GACTCCTTCT CAGAAACGCC
CACGCCAGTT CGGACATCGA CCTCTCCGTC GTCGAGGTCG ACAGTGGCGA ACGTCGACAC
GTCACCCCAC ACGAGGACCC TGCTCGATAC AGCCACCCCA CCTTCGGACC GGACGGCGAA
GCAATCTACT GTGTGAGCGA TACCTTCGGC GACACGACGG AACTGATCCG CATCGACCTT
GCAACGCTCG AGGCGGAACC AGTCACCGTC GACGATCAGG AATCTGATGA CTGGAGTGTC
GACGAGTTCG GCCTCGACGC TGCGACCGGT CGGCTCGCAT ACGCGCGAAA CGTCGACGGC
TACTCCGAGC TCTTCGTCGG TCAGCTTCGG ACACCGACCG AGGCAGACAC CGTCTCAGTC
GGGGTCCCAG ACGGTGTCGT CTCTGGGCTG ACGATCGGCC CTGCAGGAAA CCGAGCGGCA
ATGACCGTCT CGTCGACCAA CCTGAACTAC TCGATCTACA CCGTCGAACT GGACACGATC
GATGACACTG CTACACCGGA GTCGGAACGC TGGACGATAC CCTCCTCCGG TGGTATCCCC
CTCGAACAGT ACCACGAACC CGATCTGATC CGATACGAGA CGTTCGACGA CAGGGAGATT
CCGGCGTTCT TCACCCTTCC CGACGACTAC GAGGAGGGAG AGACGCCAGT GATCGTAGAC
ATTCACGGCG GCCCACACTC TCAGCGGCGT CCCTCGTGGC GGAACCGGCC GATCCGCCAG
TACTTCCTCG ATGCAGGATA CGCCCTTTTC GAACCGAACG TCCGTGGCTC CTCGGGCTAC
GGGAGCGAGT ACGCCGCGCT CGACGACGTC GAGAAACGGA TGGATTCCGT TCGAGATATT
GCGGAAGGCG TCGAGTGGCT CCGCGACCGG CCGGAGATCG ACGCCGACTC GATCGTCTGC
TACGGCCGCT CTTACGGCGG CTTCATGGTG CTTGCGTGTA TCACCGAGTA TCCCGACATC
TGGGCGGCCG CCGTCGATTT CGTCGGTATC TCGAACTGGG TGACGTTCCT CGAGAACACC
GGAGACTACC GCCGACCCCA CCGCGAGGCA GAGTACGGCT CACTCGATGC TGATCGCGAG
TTCCTCGAGT CGATCAGTCC AATCCACACG GTCGATCAGA TTGCGTGCCC GCTGTTCGTC
CAGCATGGCG CGAACGATCC GCGCGTGCCG GTCGATGAGG CACGCCAGAT CGCTGACGCC
GTCGAAGAGC AGGGCGTTCC GGTCGAAACG TGTATCTTCG AGGACGAAGG TCACCACACG
ACCAAGCTCG AGAACCGTAT CGAACAGTTC GAGCGAATCG ATGCGTTCCT CGATGAACAC
GTGTAA
 
Protein sequence
MTRDVGRYLN ARGTRGATVA PDGRLAFRAD TTGTMQVWTL GEPGGWPTQQ TFYDERVSFV 
DWSPSGDVIA FGKDTGADEH DQLYRLSPDD RTIVQLTDRP DAIHQWGAWS PDGDRVAFTA
NRRTTADFDV YVMAVDTGEG GTDHEGGADG SEPTRVCEGD GRISVVGWSP SGDRLLLRNA
HASSDIDLSV VEVDSGERRH VTPHEDPARY SHPTFGPDGE AIYCVSDTFG DTTELIRIDL
ATLEAEPVTV DDQESDDWSV DEFGLDAATG RLAYARNVDG YSELFVGQLR TPTEADTVSV
GVPDGVVSGL TIGPAGNRAA MTVSSTNLNY SIYTVELDTI DDTATPESER WTIPSSGGIP
LEQYHEPDLI RYETFDDREI PAFFTLPDDY EEGETPVIVD IHGGPHSQRR PSWRNRPIRQ
YFLDAGYALF EPNVRGSSGY GSEYAALDDV EKRMDSVRDI AEGVEWLRDR PEIDADSIVC
YGRSYGGFMV LACITEYPDI WAAAVDFVGI SNWVTFLENT GDYRRPHREA EYGSLDADRE
FLESISPIHT VDQIACPLFV QHGANDPRVP VDEARQIADA VEEQGVPVET CIFEDEGHHT
TKLENRIEQF ERIDAFLDEH V