Gene Nmag_2046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2046 
Symbol 
ID8824889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2085690 
End bp2087339 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content63% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003480178 
Protein GI289581712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.421423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCCAA TCATGGTCGC GCCGGTCGTC GTCGGCCGCG AGCGTGCACG GTTTGTGGTC 
CGCCAGCTCC AGTCGGTCAA GTGGCGCTCG GTCGCGCTGT TTCTCTCCGC ACTGGTGCCG
TTCTGGCTCT TGATCAACGT CATGACACCA GAGACGCTGT ACGCCATTGG GCTCCTTGGT
GTCCTGTTAC TGGCCTATCT CGGCTGGTCG TACTACGTCG TCGAACACGC CCCACGGTTC
CGGGCCGTCT TCGGAACGCG AGGGCTGGCG GTTGCCGTCG GTAGCTACCT CGGACTCATC
GTCTGGCTCG GCTGGCTCAC CCCGTCGCCG CTCGCGCTCG TCCACCTCGG CGCCGTCGTG
TTGATCTTCG TCTACTACTG GTTCATCGCG TTCATCGCGC TCTTCCACGA CCAGATCGGC
CGTTCGAAGT ACGTCCCGAA CCCACCCTAC CCGCAGATTA CCGTCCTCAT TCCGGCCTAC
AACGAGGAGG GCTACGTCGG CCGAACGATC CAGTCCCTGC TCGACGCCAA CTACCCCGCA
GACGCACTCG AGATCATCGC AGTCGACGAC GGCAGCACCG ACGACACGCT CGCTGAGGCG
AGCGCCTTCG CCGCCGCGAG CGAGCAGGTC TCAGTCGTCA GCAAGGCAAA CGGCGGCAAG
TACTCCGCGC TGAACTACGG CCTCCTGTTT GCCGCCGGCG ACATCATCGT CACCGTCGAC
GCCGACAGTA TCGTCGACCG AGATGCCCTG AAACACATCG TCGCCCCGTT CGCCGCCGAC
GACGACATCG GCGCCGTCGC GAGTAACGTC ACCATCTGGA ACCGCGACTC GCTCATCACC
CGCTGCCAGC AACTCGAGTA CACCATCGGT GTGAACATCT ACCGGCGCGC GCTCGATTAC
TTCGGCATCG TGATGGTCGT CCCCGGTTGT CTGGGCGCGT ACCGCCGCGA GGTGCTCTCG
GAGGTGTTCG CCTACGATCC CGACACGCTC ACGGAGGATT TCGACGTGAC GATGAAGGTG
CTTCGGGCTG GCTACCGCGT CTCCGTGAGC GATGCGCGGG TCTACACCGA AGCACCGGCG
ACCTGGGGCG ATCTCTACCG CCAGCGCCTG CGCTGGTACC GCGGCAACTA CATGACGATT
ATCAAGCACT GGTCGGTCGT GACGGACTCG TCGTACGGCT ACTTGAACCG GATCGCGCTT
CCGTTCCGGC TGGTCGAGAT GTTCTTCCTG CCGTTTGCGA GTTTCGTCGT GCTTGCGTAC
ATCCTCTGGC TGATCGCTGC CGGCCACGTC CTCACCGTGT TCGCGGTGTT TGTCTTCTTC
ACGAGCATCG TCTTCCTGAT CGCGGCGCTC GGTATCCAGA TCGAAGGCGA GGACTGGCGG
CTGCTCGTCT ACGCACCGCT GCTCGTTGTC GGCTACAAGC AGTTCCACGA CGCGCTCAAC
GTCAAGTGTC TGTTCGACGT GCTCACGAGC CCAGAACTCG GCTGGACGCG CGCAGCGCGC
ATCGAGCAGG TTGTGGAGGC TCCGGACGCA GGTGCAATAC CAGAGCCAGC TGTAAGTGCA
TCGCCGTCTC CAGCCCCAAC TCCTGAGGCA GGACCACCAA CAGAAACGGA GACAGAGAGC
GAGACCGTTG CCGACACGGA CTCGAAGTAA
 
Protein sequence
MNPIMVAPVV VGRERARFVV RQLQSVKWRS VALFLSALVP FWLLINVMTP ETLYAIGLLG 
VLLLAYLGWS YYVVEHAPRF RAVFGTRGLA VAVGSYLGLI VWLGWLTPSP LALVHLGAVV
LIFVYYWFIA FIALFHDQIG RSKYVPNPPY PQITVLIPAY NEEGYVGRTI QSLLDANYPA
DALEIIAVDD GSTDDTLAEA SAFAAASEQV SVVSKANGGK YSALNYGLLF AAGDIIVTVD
ADSIVDRDAL KHIVAPFAAD DDIGAVASNV TIWNRDSLIT RCQQLEYTIG VNIYRRALDY
FGIVMVVPGC LGAYRREVLS EVFAYDPDTL TEDFDVTMKV LRAGYRVSVS DARVYTEAPA
TWGDLYRQRL RWYRGNYMTI IKHWSVVTDS SYGYLNRIAL PFRLVEMFFL PFASFVVLAY
ILWLIAAGHV LTVFAVFVFF TSIVFLIAAL GIQIEGEDWR LLVYAPLLVV GYKQFHDALN
VKCLFDVLTS PELGWTRAAR IEQVVEAPDA GAIPEPAVSA SPSPAPTPEA GPPTETETES
ETVADTDSK