Gene Nmag_4235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_4235 
Symbol 
ID8826863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013925 
Strand
Start bp9708 
End bp11087 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content63% 
IMG OID 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_003482305 
Protein GI289594298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGACCTGC TCGAGCTGTG GGCGGTGGTC GACATGATCT CCGAATGGAC CGGCGACATC 
CACGAGGGTG ACGCCGAGGA GGTTCTTGCA GAACTCCCCG AGTCCTCGGT CCACTGCGTC
GTCACATCTC CCCCGTACTT CGGGCATCGC GATTACGGAG TCGACGGCCA GATCGGCCTC
GAGGACAGCC TCGACGAGTT CATCGAATCA CTCGTCGACG TCGCGAGCGA GATCCGGCGA
GTCCTGCGGG ACGATGGGTC TTGGTGGCTC AACCTCGGAG ATTCCTTCGC TGGATCCGGC
GGCGCAGGTG GTCAGTGGGG CCAGAACGAA CACGGCTCGG CGACTCGTCT CGCGGACGCT
GGTGATGCGT ACAACGGCCC GCTGAACACG AGCAACATCC GACGAAAGTC GAAGATGCTC
GTCCCTCACC GGGTGGCGAT TGCTCTCGAA AACGCCGGCT GGATCATCCG CGCCGACGCC
GTCTGGACGA AGCCCAACGG GATGCCGAGT TCCGCACACG ACCGTCTGAA CGAAAAGAAG
GAGTTCGTGT TCCACCTGGT TCCCGAGCCG CACTACTGGT TCAACCTCGA CGCCATCCGC
GAACCCCACT CTGAGGCGTC GCTGGAGCGG GCTGGTCGGC ACGACCAGGC GAAACGAGGC
TACCCGAGTA ACGACCACTC GCTCGAGCCA TCGCGGTTCT GCCACCCGAA CGGCAAGAAC
CCGGGCGACA TCTTCGAGAT CAACGCCGCG CAGTTCTCGG ACGCTCACTT CGCAGTTTTC
CCCGAGGAGC TGTGTAAGGA CCCCATCAAG TCATCGTGTC CTGAGAAGGT TTGCGCTGAG
TGTGGAACGC CGCACGAGCA ACTGACCGAG GAGATCGACC CGTGGAACGT CGAGAGCCCC
GATCGCGAGC AGCTTCGCCG GGCGATCGAG GTGTACAAAG CGTCCGATCT CACGGAAGAT
CACCTCGAGG CAGTTCGTGC GTACGGGTTC GCCGACGCTG CGGCGGGGAA GAACCAGAAC
CGCTCTGGCC TGAATGACGA GCGCGTTCAG CAGCTCGCCA GCGAGGCGAA GGACGTCCTC
GAGGGGTACT TCCGCGAGTT CACGACGACG TACGAGCGCC ACATCGGGTG GGAAGCCGCC
TGCGATTGTG AGACCGACGA GACGAACCCG GGAATCGTCC TGGACCCGTT CGCGGGCGCC
GGCACAACCT GCCTTGTAGC AAAACGATTC GGCCGACGGT TCATCGGCGT AGACCTAAAT
CCGGAGTTCG TTGCGATGGC CCAGCAGCGG ATCGGCCTCG ACGTCGACGA CCCTGATCTC
CTCCTCGACG AGGACGAAAC GAGCCTGAGA GAGTTCATCG AGGTCGGTGA TACCCCGTGA
 
Protein sequence
MDLLELWAVV DMISEWTGDI HEGDAEEVLA ELPESSVHCV VTSPPYFGHR DYGVDGQIGL 
EDSLDEFIES LVDVASEIRR VLRDDGSWWL NLGDSFAGSG GAGGQWGQNE HGSATRLADA
GDAYNGPLNT SNIRRKSKML VPHRVAIALE NAGWIIRADA VWTKPNGMPS SAHDRLNEKK
EFVFHLVPEP HYWFNLDAIR EPHSEASLER AGRHDQAKRG YPSNDHSLEP SRFCHPNGKN
PGDIFEINAA QFSDAHFAVF PEELCKDPIK SSCPEKVCAE CGTPHEQLTE EIDPWNVESP
DREQLRRAIE VYKASDLTED HLEAVRAYGF ADAAAGKNQN RSGLNDERVQ QLASEAKDVL
EGYFREFTTT YERHIGWEAA CDCETDETNP GIVLDPFAGA GTTCLVAKRF GRRFIGVDLN
PEFVAMAQQR IGLDVDDPDL LLDEDETSLR EFIEVGDTP