Gene Nmag_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1249 
Symbol 
ID8824081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1274834 
End bp1276129 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content60% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003479393 
Protein GI289580927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.613624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGATG AAATGACGCA CAACAGTACC GGTCGACGGA ACGTGCTCAA AACGGTTAGC 
GGTGCGGGGA TCGCTCTGGG CGGACTCGCG AGCGTGTCGA GCGTGCAGGC TGACTCCCAC
GGTGAAGCGG GTCGGTTTAA CGCTGGGTAC GCGACCGAGA CGGGAGAACA GCAAATCAGG
GACGCGGCGG ACAACGTCAT CCACGCGATT CACTCGGTGA ACACCCTCAC TGTCGAGGCA
GCGGTTGAAG ACATGGAGGC GCTCACGGAG TCCGATCACA TCGACTTCGT TGCACCGGAT
CAGGAGTACT ACGCGGGTTC GGATACGAAC GACGACAACG ACAACGCTGG GGGACAAGTC
GTTCCCTGGG GCGTCGAACG AATCGGTGCA CGGAAAGCCC ACAAAGCGAG CAAGCGCGGA
AAGGGAGCGA ACGTCGCAGT GATCAACACC GGGATCGACC CGACCCATCC GGACCTCGCC
GAGAATATCG GTGAGGGTAT CGCGTACAAC CGTGCGGTTG GCTATCCCGA GATCAACTGG
ATCGACGACA ACGGCCACGG CACCCACATT GCCGGAACGA TCGCTGCAGC CGACAACGAC
TTCGGTGTCG TCGGTGTCGC GCCGGAGTCG ACGCTCCACG CCGTGAAAGT GTTGAACCAG
GAGGGTGTGG GCTATGCATC CGACGTCGCG ATGGGGATCA TCTGGTCCGC AATCAAGGGC
TGTGATGTGG CCAACATGAG TCTGAGCGGT CCCTACTCGC CGCTCGTACA GCGGGCAATC
CACTTTGCCC ACCAGCGGGG CCTCCTGATG GTTTCCTCGG CTGGCAATAG CGGTGAGGCC
GTTTCGTACC CAGCATCCGC AGAGGAAGTT GTCGCAGTGA GTTCGACGAC ACAAGACGAT
GGCTTCGCGG AGTTCTCGAA CTACGGTCCC GAGATCGAGC TGGCTGCACC GGGTGTCGAT
ATCCTCTCAA CGATACCAGG CGGCGAGTAC GGTGTTGCAA CCGGGTCCTC GTTTGGAACG
CCGCACGTCA CTGGAACGGC AGCGCTGCTG ATGGCGAAGG GCTACTCTGC AAAGGAAGCC
CGGTACTTCA TGACCGACAC AGCGATCGAC CTCGGTCTGC CGTCCAAAAA GCAGGGTTCC
GGGCTGGTTA ACGCCGCAGC GGTGGCCAAA ATCAAGAAAG GGAAGAAAGG GAAGAAAGGC
AAGAAGGGCA AGAAGGGCAA GAAAGACGCC CACAAGAAGG ACAAGGACTA CGGGAAGGAC
AAGAAGGACA AAAAGGACAA AGCGAACACG AAGTAA
 
Protein sequence
MSDEMTHNST GRRNVLKTVS GAGIALGGLA SVSSVQADSH GEAGRFNAGY ATETGEQQIR 
DAADNVIHAI HSVNTLTVEA AVEDMEALTE SDHIDFVAPD QEYYAGSDTN DDNDNAGGQV
VPWGVERIGA RKAHKASKRG KGANVAVINT GIDPTHPDLA ENIGEGIAYN RAVGYPEINW
IDDNGHGTHI AGTIAAADND FGVVGVAPES TLHAVKVLNQ EGVGYASDVA MGIIWSAIKG
CDVANMSLSG PYSPLVQRAI HFAHQRGLLM VSSAGNSGEA VSYPASAEEV VAVSSTTQDD
GFAEFSNYGP EIELAAPGVD ILSTIPGGEY GVATGSSFGT PHVTGTAALL MAKGYSAKEA
RYFMTDTAID LGLPSKKQGS GLVNAAAVAK IKKGKKGKKG KKGKKGKKDA HKKDKDYGKD
KKDKKDKANT K