Gene Nmag_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_3043 
Symbol 
ID8825903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp3141280 
End bp3142905 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content65% 
IMG OID 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003481157 
Protein GI289582691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATT CGACAGACGA ACCTGGGCTT CGATTGCTGG TCGTCGGTCT CGACGCCGGC 
TGTCGGCCAA TTCTCCATCG ACTGTTCGAG AATGGCGAGG TTCCGACGCT CCAGCAACTG
TTCGACAACA GCGCTAGCGG GCCACTCGAG TCCCAGATTC CGCCGTGGAC GGCCAGCGCC
TGGCCGTCGA TGTACACCGG GAAGAACCCC GGCAAACACG GCGTCTTCGA CTTCCTTTCT
TTCGATGGCT ACGACTGGGA CGTGGTGAAC TCGACGCACG TGCGCGAACG CCCTGTCTGG
GAACTGCTCT CCGACCACGG CATCTCGAGT GTCGTCGTCA ACGTGCCGGT CACCCACCCC
GCACGGGAGT TCGACGGCGC ACTGATTCCG GGGATGACCG CACCCGAAGA CCCCGACTGC
CATCCGGAAG GGATACTCGA GGACGTGAAG CTCGCCTGCG GTGACTACCG GATCTACCCC
CAGAGCGGTC CCGAACCGGA CCGATCGATC GAGGGCTACG AACGGACCCT CGAACTCCGC
GGGAAAGCGT TTCGCTACCT CTGTCGGCGC ATCGAGCCCG GCTTTGGCTT TCTCCAGTTC
CAGCTCACTG ACTCCGTCTT CCACGAACGT CCGGGCGACA AGAAGGCCAT CGAGGCAGTC
TACCGCGAAG TCGACCGCCA GCTCGAAGCA ACACTCGAGG AGACCGACCC CGACAACATC
CTGGTCGTTA GCGACCACGG GATGGGGCCC GTCTCCGGCC CCGAGTTCCG GGTCAACGAA
TTCCTGCGCG ACCAGGGCTA CCTGGACGGC CAGAAGGGCG GCGAGGGAAT GCCAAACTGG
TCGACCGCCT GGGAGAACGA TCTGCTGGAA GGCGAGGACG CGAGCGAGCA CGAGGCGGGC
GCGGTAGAAC GGGCGATGAA CGCCGCGGCC AGGGTCGGGA TCACGACCCA GCGCGTTGCG
AGTGCACTCG AGTACGTCGG GCTCAAGGAG CCGATCGGGC GACGGGTACC CAACGATATG
ATCCGTGCGG CGAGCGAGCA GGTCGACTTC CCGAATTCGA TGGTCTACAT GCGCTCGAAG
AGTGAGCTTG GGTTACGGAT CAATCTCGAG GGGCGCGAGC CGAACGGGCA GGTGCCAGCT
GATGACTACG AGCGGGTGCG AGACGAGGTG ATCGACGCGC TGGCGGACGT GCGAACGCCG
GACGGTGAGC CGGTGTTCGA CGCCGTCGCG CCGCGGGAAG CCTACTTCGA CGGCCCGTAC
GTCGAGCACG CGCCGGATAT CGTCACCGTC CCGGCCGACT TCGATACAGC CGTCGCCGCC
GACCTCGGAC AGGCGCAGTT CGGCGAGCCG ATGGAGGCCT GGAACCACAA GCAAACCGGC
GTGGTGGCGA CGTCAGGCGA GGCGTTCGAC GAGCGTGCCG GCGTCGCGGG TGCGACGATC
TTCGATATCG CACCGACGAT CTGCTCGCTG TTCGACGTGC CCATCGATGC CGAGATGGAC
GGGACCACAC TTCCGGTGAT CGAGGAGAGT TCGCGCACGG AGTACCCGGC CTACGAACCC
GACCCGATCA CCGTGACCGA CGACCGGGCG GTCGAAGACC GTCTCTCTGA TCTGGGGTAC
CTATGA
 
Protein sequence
MDDSTDEPGL RLLVVGLDAG CRPILHRLFE NGEVPTLQQL FDNSASGPLE SQIPPWTASA 
WPSMYTGKNP GKHGVFDFLS FDGYDWDVVN STHVRERPVW ELLSDHGISS VVVNVPVTHP
AREFDGALIP GMTAPEDPDC HPEGILEDVK LACGDYRIYP QSGPEPDRSI EGYERTLELR
GKAFRYLCRR IEPGFGFLQF QLTDSVFHER PGDKKAIEAV YREVDRQLEA TLEETDPDNI
LVVSDHGMGP VSGPEFRVNE FLRDQGYLDG QKGGEGMPNW STAWENDLLE GEDASEHEAG
AVERAMNAAA RVGITTQRVA SALEYVGLKE PIGRRVPNDM IRAASEQVDF PNSMVYMRSK
SELGLRINLE GREPNGQVPA DDYERVRDEV IDALADVRTP DGEPVFDAVA PREAYFDGPY
VEHAPDIVTV PADFDTAVAA DLGQAQFGEP MEAWNHKQTG VVATSGEAFD ERAGVAGATI
FDIAPTICSL FDVPIDAEMD GTTLPVIEES SRTEYPAYEP DPITVTDDRA VEDRLSDLGY
L