Gene Nmag_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0073 
Symbol 
ID8822892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp89298 
End bp90506 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content64% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003478234 
Protein GI289579768 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGACA ACACCACACG TCGAACGGTA CTGGCTGGAA CAGCGGCGGG GCTCGGACTA 
CTCACGGTCG GAACGGTAGC GGCGGACGAA GAACAGGTGC GGTACGTCGT CACCGGTGGG
AGCCCACAGC AACTCGAGCG CGAGGGGTTC ACGGTCGTCC ACGAAATCTC GGACGCGAAC
GTCTATCTCA TCCTCGGTCC CGAGGATGCA GATCCGACGA GCGTCGACGG CGTCAACAGC
GCTTTCGAGG ACTTCACGTA CGAACTCGAC GTTGCCGACG AGCCAACCGA GGCGCCGACG
GGTGAGCTCG AAGACGAACA GTGGGACAAA GACCTGATCA ACGCGTTCGA CGCGCACGAT
CACGCGACGG GTGCGGACAC TCGTATCGGC ATCATCGACA CCGGCGTCCA CGACGGCCAT
CCCGATCTGG GGAACGTCGA TGTCGACGCC AGTCGGACGT TCATCGACTG GGAGGAGTCG
GACCACACGG GCGACGTGCA GTACCACGGC ACCCACGTCG CCGGCATCGC TGCAGCGACT
GGTTCCGAAG GCGTGACCGG CGTTGCGCCG GATGCAGAGA TTGTCTCACT CCGTGTCTTC
CCGTCGGAAG GACCCCTCCT CGCCAGCGTT GGAGACACCC TCCTGGCGCT CGATCATGCC
GCAGAACAGG GCCTCGATGT CGTGAATATG AGCATCGGCA GTGCGCCACA GCCGCCTGAA
GAGAATCAGG AGGGGTATCG CATCGCTCGC CAGCAGGTCG TCCGAAGCGT CACCCAGCGG
GGAACGTCCG TCGTCGTGAG CGCTGGCAAC GACAGCCAGA ACCTCCAGCA GGGCGGCTGG
ATCAGTCTCT GGGGAAGTAT CCCCCGTGCG TCGTCCATCA GCGCGACCAC GGAAGCCGAC
GAACTCGCAG ACTTTTCGAA CTACGGCGCG AACGAAATCT CGGTCGGTGC GCCTGGTGAC
ATGATCCTCT CGACGTTCGA CCCGGACAAC GAGGCGCTTC CCGGTACCGA GTACGCGAGC
GCATCCGGGA CCTCCATGTC CGCCCCACAG GTCGCCGGCC TCGTCGGCCT CGTCCGCGAA
CTCGATCCGA ACGCCAACGT AAATCAGGTC GAAAACGCGA TCGAACGCGG TGCGACGGGC
GACGGCCGCG ACGATCCTGA GACCGGTGCC GGTCGCATCG ACGCACTCGA GACGGTCGAC
CTCCTGTAA
 
Protein sequence
MADNTTRRTV LAGTAAGLGL LTVGTVAADE EQVRYVVTGG SPQQLEREGF TVVHEISDAN 
VYLILGPEDA DPTSVDGVNS AFEDFTYELD VADEPTEAPT GELEDEQWDK DLINAFDAHD
HATGADTRIG IIDTGVHDGH PDLGNVDVDA SRTFIDWEES DHTGDVQYHG THVAGIAAAT
GSEGVTGVAP DAEIVSLRVF PSEGPLLASV GDTLLALDHA AEQGLDVVNM SIGSAPQPPE
ENQEGYRIAR QQVVRSVTQR GTSVVVSAGN DSQNLQQGGW ISLWGSIPRA SSISATTEAD
ELADFSNYGA NEISVGAPGD MILSTFDPDN EALPGTEYAS ASGTSMSAPQ VAGLVGLVRE
LDPNANVNQV ENAIERGATG DGRDDPETGA GRIDALETVD LL