Gene Nmag_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3946 
Symbol 
ID8826816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013923 
Strand
Start bp351847 
End bp354333 
Gene Length2487 bp 
Protein Length828 aa 
Translation table11 
GC content62% 
IMG OID 
Productconserved repeat domain protein 
Protein accessionYP_003482049 
Protein GI289583639 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.884871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAGA CTGCCAGATA CGTTCGACTC GCGATGGTTC TCGTTGCAGC CGTTGCGATC 
GCCATCGGTG GGACAATGCT GGTTGCAACC GAAACCGTTC TCGACGCCCT CTCCTACGAT
GTCGTCGTCG GACTGGAGGT GTTCGGTCAA CTGTTCTTTT ATCTGGTGTT GCTGGCCCTG
CTCGTCCAGA GTTACCGGAT CGTCCGGCGA CGGATCACTG CTGGCGAGAC CGAGGTGACA
CCACCGGACC GCGAAGCCAC GACTCGAGTA CCAACTCCTG GCGATAGACT CTCCCAGCAA
CTCGCGTCGG TGTCGGGTCG GTCGGATCCG ACGTACGAGG CGCTTCGGGA TCGATTGCGA
GCGCGTGCTC GAACAGCGCT TGTGAGCGCC GATCAGCGAA CGGAATCCGA CGCGGTCGAA
GCGATCGAAG CTGGCACCTG GACGGATGAC GACGTTGCAG CTGCGGTACT CGCCGACGAC
AGCGACTGGC CGTCACGGTC GGTCGAGGAT CGTCTCCTTG GGTTTCTCGG CTTTACCGAC
ATCCTCGCAC AGCGTGACCT GCGGCGCGCG ATGACTGCCG TTGCAGCGGT TGCCGGGGCA
CCCTCGGACG ACCGGCGGCC CACGGCTGTT TCTCCGGTTC TGTCTGGGGA GTGGGATTGG
GACTGGGAAC GGGAACAAGA ACAGGAACAG GAACAGGAAC GGGAACAGGA ACAGGAACGG
GAACGAAAAC GAGAACAAGA CTCAACTGCG TCAGCTGTCG ACCAAAAGCA AGCTTCGGAA
ACGGTTGCTG ATGACGATGG ATCGTACTCG AGTCCTGACG ATACTGACGA CCGGACAGAC
GATGATTCCG AGCAGGCAGA CGACGGTCCC GAGCAGGCAG ACGACACCCC TGAGCAAATG
GGGGATGATC CCGAGCGGCC TGCTGATGCA CCCACACGGC GTGAAACGGG TCGCTGGGAC
GGAATTAGCG TCGTCGCGCT CGTCGGTCTC GTACTGGGCG TGCTCTATCA GCAGCCAGCA
GTCATCGCGG CAGGGGCAGT TGGAATCGGG TTTGGGGCGT ACGCACACCT CGGTACGGAC
GACGACACAC AGCCGTCGAT TACGGCTGAG CGTAGCCTCA GTGAGACACA CCCTGAACCA
GGAGCCTCGG TCGACGTGAC GACGCTGGTG CGAAACGACG GGGATGGACG CCTCACTGAC
GTCCGAATTA TCGACGGCGT TCCGTCCGAC CTGGTCGTCG GGTCAGGCTC ACCGCGAGCA
GCGACGACGC TCGCGCCGGG AGAGGCAGTG TCAATACAGT ATACAGTCAC CGCGCGTCGC
GGTAGCCACC CATTCGACCC GGTGCAGGTT CTCGTCAGGA CTGCGAACGG GTCGATCGAG
ACAGAGCTGG CTGTTTCGGC GTCAACGCCG AGGGCAGCGA AATCGGCAGC GACAGAGAAC
GGAAGAACAT CCGTAGACAC AGCGATCACC TGCCTGCCGA CGATGCAACC GCTTTCGACG
GGTGTTGATC GGCTCCTTAC ACACCGCGGG TCCCGTCCCG CAGGCACACT GCAGACGGCC
GATCCAGGCC CTGGAGTCGA GTTTCACAGC GTCCGTGAGT ATCGTTCGGG CGATCCGATG
CAGCGGATCG ACTGGAACCG CTACGCACGA AGCGGTTCGC TGGCGACGGT TTCGTTCCGC
GACGAGCAGT CGGCGACCGT GATCGTGGCC GTTGACGCCC GAGCGAGTGC GTACCGCACG
CCCGATGATA CTGAGACGGT GCCGATGCAC GCCGTCGACC GGGCTGTCGA CGGTGGGGCT
GACGTTTTCG CCACGCTGCT CGATGCAGGC CACACCGTCG GCCTCGCATC CGTCGGCCCT
GATCCGCTCT GGCTGGCCCC CGGAACCGGT CGCGAGCACC GGATTCGTGG CCAGACTGCA
CTCGGCACGG ATCCGGCAGT CGGTCCACGA CCGCTTGCGA TGTCGACTGT CGGGACTAGT
ACTGACAGCA GTGTGGATAC TGTCGATACT GTTTCCGAAA CCGATGTCGA TGCTGATGAC
ACCGACGGCA GCGATGATGC CGATGACACC GACGGCAGCG ATGATGCCGA TGACAACACT
GACACCAACA CAGACGAGAC GACCACTGCT AACAGACCTA CTGCTAATAC CGCCAGCACC
AGCACCAACA CCACTAGCAA CACCAGCACC ACTTCCTCCA CCTACACCCT CACCCACCTC
AAGCACACCA CGTCAGCAGC CACTCAGATC GTGTTCTTCA CACCTCTCTG TGACGACGGG
GCTCTCGAGA TTGTCCGCTC ACTGTACTCG AGTGCGTTTG CCGTGACGGT AGTTACACCG
GATCCGACGA CGGTGGATGG GCCGTCGAGG GCAGTTGCAC GGGTCGAACG GAGAGCCAGA
ATACGACAGC TCCGATCGGA GGGAATTGCA GTGATCGACT GGGCGTGGGA CGATCCGCTT
GCGACTGCGA TGGAGGGTGA ACGATGA
 
Protein sequence
MSETARYVRL AMVLVAAVAI AIGGTMLVAT ETVLDALSYD VVVGLEVFGQ LFFYLVLLAL 
LVQSYRIVRR RITAGETEVT PPDREATTRV PTPGDRLSQQ LASVSGRSDP TYEALRDRLR
ARARTALVSA DQRTESDAVE AIEAGTWTDD DVAAAVLADD SDWPSRSVED RLLGFLGFTD
ILAQRDLRRA MTAVAAVAGA PSDDRRPTAV SPVLSGEWDW DWEREQEQEQ EQEREQEQER
ERKREQDSTA SAVDQKQASE TVADDDGSYS SPDDTDDRTD DDSEQADDGP EQADDTPEQM
GDDPERPADA PTRRETGRWD GISVVALVGL VLGVLYQQPA VIAAGAVGIG FGAYAHLGTD
DDTQPSITAE RSLSETHPEP GASVDVTTLV RNDGDGRLTD VRIIDGVPSD LVVGSGSPRA
ATTLAPGEAV SIQYTVTARR GSHPFDPVQV LVRTANGSIE TELAVSASTP RAAKSAATEN
GRTSVDTAIT CLPTMQPLST GVDRLLTHRG SRPAGTLQTA DPGPGVEFHS VREYRSGDPM
QRIDWNRYAR SGSLATVSFR DEQSATVIVA VDARASAYRT PDDTETVPMH AVDRAVDGGA
DVFATLLDAG HTVGLASVGP DPLWLAPGTG REHRIRGQTA LGTDPAVGPR PLAMSTVGTS
TDSSVDTVDT VSETDVDADD TDGSDDADDT DGSDDADDNT DTNTDETTTA NRPTANTAST
STNTTSNTST TSSTYTLTHL KHTTSAATQI VFFTPLCDDG ALEIVRSLYS SAFAVTVVTP
DPTTVDGPSR AVARVERRAR IRQLRSEGIA VIDWAWDDPL ATAMEGER