Gene Nmag_3633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3633 
Symbol 
ID8826501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp17694 
End bp18938 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content64% 
IMG OID 
Productpeptidase S8 and S53 subtilisin kexin sedolisin 
Protein accessionYP_003481744 
Protein GI289583334 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.183359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACACC AACGACAGAC ACGCCGGCGG GTACTACGGG GAATCACAGC AACGGGCCTG 
ACGCTCGGGA TGATCGGGAC TGGCAGCGCA CAGTCGTCGT CGACGTACGT CGTCACCGGG
GGTGGGCGGA CGCGCCTCGA GAACGCAGGC GCGACGATCA GGCGAGAACT CGCGAACGGA
TCCGTCTTCA TCGTCTCGGC CGAGGATGGC GCTGCGGACG ACCTGCGCTC CGTTTCGGGC
GTCAGTGGGG TCACGGAGAA CTTCGAGGTC GAACACGATG GGCCGATTTC AGAGGTGGAG
CCACAGACGA CCGACGACGC CGAATTCACC GAGAAGCAGT GGGACAAGGA GATTACGGAT
ACGTTCGAAG CCCACGACTA CGCAACCGGT GAGGGGACGC GAATCGTCAT CGCAGATACC
GGCGTCGACG GCACGCATCC GGATCTGGAA GCGAACTTTA ACGAGGAGTT GAGCGTCTCG
TTCGTCGACG GCGGCGAAAA AGATGAACAC ATCGGCGACT CCGGCGACCA CGGCACCCAC
GTTGCTGGCA CCGCGGCCGC AACCGGTGCA GAAGGGATTA CCGGAACCGC ACCTGACGCC
GAACTCGTCT CCGTTCGTGT CCTCGGTCCA GATAGTAGCT CCTTCGCGGA CATCCTCGCC
GCAGCCGACT ACACCGCCGA GATCGGTGCA GACGTCGCGA ACTACAGCCT CGGTGCGGGT
CCGTTCCCAC CCGAGGCCAA CAGCGACGGT ACTCGAGTCG CCGTCCAGAA GGTGATGCAA
GATGTCGCCC GTCGTGGGAC GGTGTCGACA GTCTCTGCAG GCAATGCCGA GACCGATCTT
CAGCGGGGTG GCCTGTTCTA TCTGCCGGGG ACCGTCCAGG GAGTGATGAC GGTTTCGGCG
AGCGGTCCGG GGGACAACCT TTCGTTCTAC TCGAACTACG GGACGAGTGA GATCGAGGTC
GGCGCACCCG GTGGTGGCCG GGGGACACTC GAGGAAACTG TCACCCCCGA CGATCTCGTC
TTCTCGACCG AACCAGACGG GACCTACGGC TGGAAGGCCG GCACGTCGAT GGCTGCCCCG
CAGGTTGCGG GACTCGTTGG ACTCGTGCGT GAACTCGAGC CTGATGCACA CGCGAACCAG
GTCGAGAACG CGATCGCACA CGGTGCGGAA CTCGTTCCGG GACGCAGCAG CCCCGAGTTC
GGCGCTGGTC GAATCAACGC GCTGAACACC GTCAGCAACC TGTAG
 
Protein sequence
MAHQRQTRRR VLRGITATGL TLGMIGTGSA QSSSTYVVTG GGRTRLENAG ATIRRELANG 
SVFIVSAEDG AADDLRSVSG VSGVTENFEV EHDGPISEVE PQTTDDAEFT EKQWDKEITD
TFEAHDYATG EGTRIVIADT GVDGTHPDLE ANFNEELSVS FVDGGEKDEH IGDSGDHGTH
VAGTAAATGA EGITGTAPDA ELVSVRVLGP DSSSFADILA AADYTAEIGA DVANYSLGAG
PFPPEANSDG TRVAVQKVMQ DVARRGTVST VSAGNAETDL QRGGLFYLPG TVQGVMTVSA
SGPGDNLSFY SNYGTSEIEV GAPGGGRGTL EETVTPDDLV FSTEPDGTYG WKAGTSMAAP
QVAGLVGLVR ELEPDAHANQ VENAIAHGAE LVPGRSSPEF GAGRINALNT VSNL