Gene Nmag_1446 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1446 
Symbol 
ID8824279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1475572 
End bp1477404 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content67% 
IMG OID 
Productpeptidase M3A and M3B thimet/oligopeptidase F 
Protein accessionYP_003479587 
Protein GI289581121 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGACT ATCCAACACG CAACGAACTG GACGACCAGT ACACCTGGGA CCTGACCCGC 
CTCTACGAGA CCCCCGCCGA CTGGGCACGC GCCGCAGACG AACTCGAGTC CCAGCTCGAA
ACGCTACACG ACCACCACCA GCCGACCGAC TCACCCGCCG CGCTGGCAAC CGCACTCGAG
TCCATCGAGT CTGCGCTGGT TACCAAGGGT CGGCTGACCC TCTACGCGCA GTTGCACCGA
AACGAGCATC CGACGGACGA GAGGCGGCGC GACCGCTACA GTCGCGGGCA GCGACTCGCC
GCCAGGGTCG ACGAGGCCGT TCGCGGACTA CAGCGGCGGA TTCAGCGCGA TGCAGCGAGC
GTGCGTGCCT TCCGCGAGGA GGGGGACAAC GAACTGGACG GCTGGCACGC CTACCTTGAC
GACCTCCTCG CGCAAGCGCC CCACACCCGC GACGCCGACA CCGAATCGGT CGTCAGCGCG
TTCGCACCCG TCATCGAAGC CCAGACGGAC ACGATCGCCG CGATCAAGAC TGCCGATTTC
GACCCGCCAA CCGTCGAGGG ATCGGACGGC AATCCGGTCG CCATCGACCA CGGAACGTAT
CGGGAGGCAC TCGAGAACCC CGACCGATCG GTCCGCCGAC GAGCACACAC GGGGTACATC
GACGCGCTGG CCGAGCACGA CCACGCGCTC GCGGCGGCGC TCACCGAGAA GGTGCGTGCT
CACGCGGCCC TTGCAGCGGT TCGAAACTAC GAGTCAGTCC GCGAACTCGC GCTCGCAGAA
CCGAGCTATC CCGACACCGG GATGCACACC TCGTTTGCCG AAGCGACCCA CGACGCGGTT
CTCGAGAACG TCCGCGATCA CCTCGAGGCG TATCACGGGC TACTCGAGAG CCGCCGCGAA
CTGCTCGACG TCGAAACGCT GCGTCGGTGG GACGAGCGGG TACCCGTGGT TTCCGACGGC
GCGGACGCAC CCGAAATCAC GTTCGAAGAA CTGTGTGAGC ACCTCCTGGC TGCGGTCGAA
CCGCTCGGTG CGGACTATCG GAACCGACTC GAGACGCTCC TCACGGAGCG GCGAATCGAC
GTCTACCCGA CAGCGCGAAA GCGGACGGAC ATTCCAGCGT ACTGTCCCTC GAGCCCGGAC
GCCGGTCCAT TCGTGCTCGC GAACTTCCGC GAAGACGTGC GGACGGCGTT CTACCTCGCG
CACGAACTCG GCCACGCGAT GCACATCGAG TGTATGCGGG CGTCTCAGCC GCCGCGGTAC
GTGAACAGCC CGCGGCCGAC CAGCGAGGTG CCGAGTCTGG TCCACGAACT GCTGTTAGCC
GACCATCTCC GCAAGAAGGG TGGCCCGGAA CTCGCCCCGT TCGTCCGCGA GCGCCGAGCG
CAGTTCCTCG CCGGCAACGT GTACGGCGCT GGCGAGAGCG CGACGTTCCT CCACAAGGTG
TACCGAACCG CCGAGTCGGG GACCGACCTG ACGCCGACTG GGCTGTCGGA GTTGTACGCG
GACATTGCCG CCGAATTTCG TGCGCCGGTT GCGCCGCCGG AGGGTGACAC GGCTGCAGCG
ACGCGCGGCG CACCCTGGCG ACAGCAGTCG TACATCCGTG ATCCGTACCA TAACTACCAG
TACGTCACCG GCACCGTCGC GGCCGTCTCG GTTGTCAGGC GGCTTCAGTC GGGTGCGCTT
TCCGCCGGAG AGTATCTCGA GTTCCTCCAG AACACGGGGC GGCGCGAGTC GAGTGTGTCG
TTCGACGCGC TCGGCGTCGA CGTGACGGCG GCCGAGCCGT ACGAGCGCCT GGCGAGTGCA
CTCGAGTCGA TTCGGGCTGC TCGGCTGGGC TGA
 
Protein sequence
MTDYPTRNEL DDQYTWDLTR LYETPADWAR AADELESQLE TLHDHHQPTD SPAALATALE 
SIESALVTKG RLTLYAQLHR NEHPTDERRR DRYSRGQRLA ARVDEAVRGL QRRIQRDAAS
VRAFREEGDN ELDGWHAYLD DLLAQAPHTR DADTESVVSA FAPVIEAQTD TIAAIKTADF
DPPTVEGSDG NPVAIDHGTY REALENPDRS VRRRAHTGYI DALAEHDHAL AAALTEKVRA
HAALAAVRNY ESVRELALAE PSYPDTGMHT SFAEATHDAV LENVRDHLEA YHGLLESRRE
LLDVETLRRW DERVPVVSDG ADAPEITFEE LCEHLLAAVE PLGADYRNRL ETLLTERRID
VYPTARKRTD IPAYCPSSPD AGPFVLANFR EDVRTAFYLA HELGHAMHIE CMRASQPPRY
VNSPRPTSEV PSLVHELLLA DHLRKKGGPE LAPFVRERRA QFLAGNVYGA GESATFLHKV
YRTAESGTDL TPTGLSELYA DIAAEFRAPV APPEGDTAAA TRGAPWRQQS YIRDPYHNYQ
YVTGTVAAVS VVRRLQSGAL SAGEYLEFLQ NTGRRESSVS FDALGVDVTA AEPYERLASA
LESIRAARLG