Gene Nmag_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1942 
Symbol 
ID8824783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1976302 
End bp1977633 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content64% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003480075 
Protein GI289581609 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCCAC TCGAGTTCGG TCGACTCGCC TGGCGCTCGA TCGGGAGCCA TCGGTTGCGC 
TCGGCGCTGA CGACCCTTGG GATTATCATC GGTATCGCGG CGGTGATCGC GTTCGTTACG
CTGGGTGCGA GCCTGCAGGC CGGCCTGCTC GGCGACATCA GCCCGGACGA TCAGCGGAAT
CTCTACGGGT GGGCCGCCGA TCCAGACACG GAAGGTGGCC CACTCGCCGG TGCGCAACCG
GTGTTCACGC AGGACGATCT CGAACAGGTG GACGAACTCG AGGACGTCGA CGCGGCCTAC
GGCTATATGC CGATTTCGAC GCAGGCGCTC GCCTACGACG GTGAGCTAAC GCCACAGAGC
GATGCGCTGA TCGCCGCGGG ACCACCGTAC ATCAGACCGG CGACGATCGA CGAGGGTCGG
CAGTTCGAGA TGGGCGAACG TGAGGCAGTG ATCAACCCGG CAGTGGCCGG CCAGTTCGAG
GAGAACGTCT CCGTCGGCGA CGAGTTGACC ATCGTCAGGC AGGGCGGCGA GCAGACGTCG
GTGACTGTCG TCGGGATCAC GGACAGTTCT GAGGGACTGA GTCCGTTCGA AGGGTTCGAG
CCGTCGCCAC GGGTGTACGT GCCGACGGAC CCCTACTACA CGGAGGAGGT AGACGGGATC
GGTGCCGGGT TTGGTGGTGA CGAAGCAGCT GAAGACGAAG CGGACGAGGC GGATCCAGCC
GACGGCGATG ACGGCGATGC AGCAACCGCC GAGGACGCCA GATTCCTCGC AATCGTCGTC
GAGGCACCGT CTGCCGACGA GGGGGATATC GACCAGGCTC GCGACAGCGC ACTCGCCGTA
CTCGAGAGCG ACGACTCCGA CGCGAGTGAG TTGCTCGGCG ACGACCTCGA GATCACCATG
CAGACGAGCA CCGAGTTGCT CCAGCAGCTA CAGGACATAC TCGACCTGCT GCAAAACTTC
ATCGTCGGCA TCGCGGCTAT CTCGCTCGTC GTTGGTTCGA TCGGCATCGC GAACATTATG
CTGGTCAGCG TCACCGAGCG GACCCGTGAG ATCGGGATTA TGAAGGCCGT TGGTGCGCAG
AACCGGGACG TGTTGGGCCT GTTCCTGACG GAAGCGGTGG TGCTGGGAAT CATCGGTGCC
ATCCTCGGCA CGGTACTCGG ACTCGCCGTT GGGTACGCCG GGGCGTGGTA CATCGATATT
CCGCTCGTCT ATCCCTACGA GTACGTCGCG CTCGCTGTCG CCGTGGGAAT CCTCGTCGGC
GTTCTCTCGG GGCTCTATCC CGCCTGGCGG GCGGCCCGAA CGGATCCGAT CGACGCGCTT
CGGTACGAGT GA
 
Protein sequence
MRPLEFGRLA WRSIGSHRLR SALTTLGIII GIAAVIAFVT LGASLQAGLL GDISPDDQRN 
LYGWAADPDT EGGPLAGAQP VFTQDDLEQV DELEDVDAAY GYMPISTQAL AYDGELTPQS
DALIAAGPPY IRPATIDEGR QFEMGEREAV INPAVAGQFE ENVSVGDELT IVRQGGEQTS
VTVVGITDSS EGLSPFEGFE PSPRVYVPTD PYYTEEVDGI GAGFGGDEAA EDEADEADPA
DGDDGDAATA EDARFLAIVV EAPSADEGDI DQARDSALAV LESDDSDASE LLGDDLEITM
QTSTELLQQL QDILDLLQNF IVGIAAISLV VGSIGIANIM LVSVTERTRE IGIMKAVGAQ
NRDVLGLFLT EAVVLGIIGA ILGTVLGLAV GYAGAWYIDI PLVYPYEYVA LAVAVGILVG
VLSGLYPAWR AARTDPIDAL RYE