Gene Nmag_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_1620 
Symbol 
ID8824455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp1641343 
End bp1642587 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID 
ProductDNA methylase N-4/N-6 domain protein 
Protein accessionYP_003479758 
Protein GI289581292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGG ATTATAGCGC GTATGAAGAC GCGCTATCAT CGGCGGAGAT CATCGGCACA 
GAGACGCTCC CACATTCTTG GCGGGAGGCA CCTCGTGCGT GGGGTAACGA ATTACACAAG
CTTGCTCCCT ACGTTGGTGG GTTTCCACCC GCACTTGCGA ACTATCTGCT TCAACAATAT
ACTGACCCCG AGATGACGGT CCTTGATCCG TTTTCGGGCG GCGGCACGAC TGCTCTTGAG
GCTTCGCTTC TTGATCGCGA TGTTCTGGCC AGTGATGTGT TCAGCTATGC GTGTACGCTC
ACTGGGGCAA AGACGCATCC TTTGACTGAA GACGAATTCG ATGCGGCACT GAAACGGGTC
AAGCGAGACG CAGATAATGT CAACCCAGAT ATGTTACCAC CCGTGGATGA GGATGCAGCA
ATCTTTTTTC ACGAGGACAC ACTTCAAGAA CTGCGGCAAT ACCGAGCCGT TCTCAAAGAG
GACACCAGTC GTGATGGCAG GTTTCTGAAG GCGGTGATAT GTGGCATTCT TCACGGCCCA
TCAGAGCTAT TTCTTTCAAT TCAGACTCGC GATACGTTCT CTGGGAGTGC GAATTATGTT
AAGGATTACA AGGAAAAACA CGATCTAGAG GAAGTATATA AGCCAGTTGA TGAGAAGGTC
CGTGACAAGT TTAATCGACT CGTTGGTGCG CAATACCCTT CCGGGAGCAC CCGAGTTGAG
CAAGCTGATG CGACGTCGCT GCCATTTGAG GATGATACAG CTGACTTTGT ATTGACGTCA
CCGCCGTACA TGCATATGTT AGACTATTCA TGGAATAATT GGCTTCGGCT GTGGTGGCTT
GACGAGGATC GGTCGGCGGA GCAAGACTCG CTCAATCAGA CATCCAAGGT AGAGTTGTTT
CGGTCGTTCA TGACCGATGT AATCGCGGAA CTGGACCGTG TGTTAAAATC CGATGCGCGA
GCTATCATCG TTATTGGTGA CGTCCGTAAA CATCGGCAGG GTGGCGCGAA GGTAGTCTAT
CCAGCTCGGA TGATTGCTGC GGAGGCGAGT GAGTTTGGTT TCGAGGTTGA GCGGGTTATT
GAGGACGATT ACAACGTGGA CAAGCGCTAC TACACGCAAT TGAACAATCT CCGGTGGGAT
GAGGAGGAAG AAGATGACGG GCAGGAGCTA ATCGATCGAA TTCTCGTGTT ACGGAAGGGA
GATCCCGGAC CACAGGTAGA GGTAACGCCG GAGTGGAGCG ACTGA
 
Protein sequence
MSQDYSAYED ALSSAEIIGT ETLPHSWREA PRAWGNELHK LAPYVGGFPP ALANYLLQQY 
TDPEMTVLDP FSGGGTTALE ASLLDRDVLA SDVFSYACTL TGAKTHPLTE DEFDAALKRV
KRDADNVNPD MLPPVDEDAA IFFHEDTLQE LRQYRAVLKE DTSRDGRFLK AVICGILHGP
SELFLSIQTR DTFSGSANYV KDYKEKHDLE EVYKPVDEKV RDKFNRLVGA QYPSGSTRVE
QADATSLPFE DDTADFVLTS PPYMHMLDYS WNNWLRLWWL DEDRSAEQDS LNQTSKVELF
RSFMTDVIAE LDRVLKSDAR AIIVIGDVRK HRQGGAKVVY PARMIAAEAS EFGFEVERVI
EDDYNVDKRY YTQLNNLRWD EEEEDDGQEL IDRILVLRKG DPGPQVEVTP EWSD