Gene Nmag_0076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_0076 
Symbol 
ID8822895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp92442 
End bp93692 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content66% 
IMG OID 
Productprotein of unknown function DUF418 
Protein accessionYP_003478237 
Protein GI289579771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.477208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGAAC GCGCCGAGGA CGGACCGACA CCGCCCTCGG AACGGATTGT CGCCCTCGAT 
GCGCTGCGCG GGGTTGCCTT GCTTGGGATT TTGCTCGTCA ACATCTGGCT GTTTGCGATG
CCAGAGGCGA CGCTGTCGAA TCCGACGGTG CACGGCGAGT TCACCGGTTG GAACTACTGG
GCCTGGTTCG GCACGCACGT TTTCGCCCAG CAGAAGTTCA TCACGCTGTT CACCTTGCTG
TTCGGAGCCG GCATCAGCCT GTTCACGCAG AGTGTCCGGT CGAAATCGAC GGAGCACGCT
GCGACCGTGC TGTCGGCGCG TCGCTTCGGC TGGCTCGTGC TGTTCGGACT CGCCCACGCC
TACTTGCTGT GGTACGGCGA CATCCTCGTG GCGTACGGCG CGTGCGCGTT CGGTGTCCTC
GTCCTCCGAG ACCTTCCGGC CCGAACCCTG TCAGCAATCG GAATCGGGCT CATCGCAGTG
CCGTCGCTTC TCGAGATACT GGCGGCGCTG ACGGCAGACC CAAGCACGGT CGCGGATACG
TGGCGGCCGG CCGAGTCAGT TCTCCGCGCT GAAGTCGAGA CCTACAGAAG CGGCTGGATT
GAGCAGTTCG ACCACCGTGC GTCGACGTCG TTCCGCCGCC AAACCACTGA CTTTGTCGGC
TACACCGCCT GGCGCGTCGG TGGCGTGATG CTCCTCGGGA TGGCGCTGTT CAAGTGGGGC
GTGCTGACGA ACGAGCGGTC CGCACGATTC TACGCCCGGC TGGCAGCCGT CGGCGCGGTC
AGTGGCCTTG TCGTGATCCT GGCTGGCGTC GCCTACATCC ACGCCCACGA CTGGGGCGTC
GAGGGCGCGC TCCTCTGGCG GCAGTTCAAC TACTGGGGCA GCCTCCCGCT CGCTGCTGCC
TACCTCGCAC TCGTGATGTG GTTCTGTCAG TGGCGACCCG ACGGCCTCGC AACCCGGTCG
TTCGCCGCCG TCGGCCGCAC CGCGTTCAGT AACTACCTCC TGCAGACGGT GCTCGCCACG
TCCATCTTCT ACGGCCACGG CCTCGGCCTG TTCGGCGCGG TCACGCGGGT CGAACTGCTC
GCCATTGTCG TCGCCATCTG GGCGGTGCAG GTGCCGCTGT CGGTGCTGTG GCTGCGGTAC
TTCCGGTACG GGCCGATGGA GTGGCTCTGG CGGGCGCTGA CGTACAAGTC GAAACCGCCG
CTTCGGGAGT CCCGTGCTGG ATCGGACACA GCGGAGCAAA GCGACGGGTA G
 
Protein sequence
MSERAEDGPT PPSERIVALD ALRGVALLGI LLVNIWLFAM PEATLSNPTV HGEFTGWNYW 
AWFGTHVFAQ QKFITLFTLL FGAGISLFTQ SVRSKSTEHA ATVLSARRFG WLVLFGLAHA
YLLWYGDILV AYGACAFGVL VLRDLPARTL SAIGIGLIAV PSLLEILAAL TADPSTVADT
WRPAESVLRA EVETYRSGWI EQFDHRASTS FRRQTTDFVG YTAWRVGGVM LLGMALFKWG
VLTNERSARF YARLAAVGAV SGLVVILAGV AYIHAHDWGV EGALLWRQFN YWGSLPLAAA
YLALVMWFCQ WRPDGLATRS FAAVGRTAFS NYLLQTVLAT SIFYGHGLGL FGAVTRVELL
AIVVAIWAVQ VPLSVLWLRY FRYGPMEWLW RALTYKSKPP LRESRAGSDT AEQSDG