Gene Nmag_2554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmag_2554 
Symbol 
ID8825409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatrialba magadii ATCC 43099 
KingdomArchaea 
Replicon accessionNC_013922 
Strand
Start bp2624277 
End bp2626799 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content67% 
IMG OID 
Productdihydropteroate synthase 
Protein accessionYP_003480676 
Protein GI289582210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.324611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGTATC ACGAGGCGGC GGACGTTCTC TTCGGCTTGC GGCGCTTTCG GCCGAAGCCG 
GGGACGGCGT CGACGGCGCA GCTACTTGCA ACCCTCGACG ATCCACACGA GGACGTCGCG
TTCGTGCAGG TCGCCGGCTC GAACGGGAAG GGCAGTACGG CGCGGATGGT CGAACAGACG
CTTCGCGAGA CGGGGCTCTC GGTCGGACTC TACACCTCTC CCCACCTGGA GGACCTCCGC
GAGCGTGTCC AGGTCGACGG CCGGAAGATT CCCCGCTCAG CTGTCTGTGC GTACGTCGAA
TCGGTCCACG ACTACCTGAC CGAGCAGGCC GCCGACGGGA ACTCGCCGAC GTTCTTCGAG
GCGATGACCG CGATGGCGAT CTGGCAGTTC GGCCGCGAAA ACGTCGACGT GGCCGTCCTC
GAGGTCGGTA TCGGCGGCCG GTACGACGCG ACCAGCGTCG TCGATCCCGT TGCGAGCGCG
GTTACGAGCG TGACGCTCGA ACATACCGGC ATTCTGGGCG AGACGGAAGC CGAAATCGCC
CGCGACAAGG CACACGTCGC ACCCGCGGAC GCACCGCTCG TTACAGGCGT CACCGGGCCG
GTACTCGAGT CCATTCGGGA GGTTGCAGGC GAGGTCGTTA CAGTTGGAAC GAGGGGTGGG
GGGCAGGGAG CGGACGGTGA CGACGAGACG ACTGCGCCAG ACGTCCGCGT CGAGTACGGC
GGGCGCGCAA ACCACACCGA AGCCGCGGTC TCGATCGACG CCGACGACTG GTCCGTCTCG
ACGCAGATTC CGCTGCCGGG CACTCATCAG GCCGAGAACG CCGGCATCGC CGCCGCACTC
GCACGCCAGA TCGCCGACGT GTCGACCGAC GAACTCGCAC GCGGGCTGCG AAACGCCCAC
TGGCCGGGAC GCTTCGAGGT GCTGGATACC GAGCCGCTCG TCGTCCTCGA CGGCGCGCAC
AACCCCGGTG CCTGCGAACA ACTCGCTGCC ACGCTCGGAA CGTACGAGTT CGACGACCTC
CACCTCGTCT TCGGTGCGAT GCACGACAAG GACCACCGCG AGATGGTCGC CGCGCTGCCG
ACCCCCGACG CTGTAATTAC GGCAGAGCCG GGACTCGACC GCGCCGAGGA CCGCGACGTG
CTCGCGACTG TCTTCGATGA CGCCGGCACA GCCCAGGTAA AGACGACGCG GAGCGTCCCC
GACGCGCTCT CGCAGGCGCT CGCCGACGCC GGTCCCGACG ACTGCGTCCT CGTCACGGGC
TCGCTGTTCG CCGTCGCCGA GGCCCGCTCG CGCTGGACGA CGACCGGGAT CACCAAGCGA
ATCCGCGACC GGTCGGACGC ACGGAATGCA CTCGAGTCCG CACAGGTTTC AGACGCCGAC
GTCGACCGCT TCGACGGCGA AGCGGTCCAC CGCGTAGTCA AGACGAACCT GCAGTCGTCG
CAGGCGAACC GACTGCGAAC GGAACTGGTG CGTCTCGGCG GCGAGTGTGT CGTGTCGGGA
GCCGAGGACG GACACGAGGA GCGCGTCGAT GCGGTGTTGA TGGGAACGCT CGCGCAGTTC
GAGCAACTGG TATCGACCCT CGATCGCGAG GACGGACTCG AGACGGTTGC GCGTGAGATT
GGCGAGACGC TCGGTCTCGA GTCGGCAGCC ACTGCGGCTC CGGAGTCAGA CACCGCTACA
GCCACACCCA CCAGCGCCGA GACACCGCCC TGGGCCGACC GCACCGCCGT CATGGGTATC
CTTAACGTCA CCCCCGACAG CTTCCACGAC GGCGGCGAGT TCGACGCACT CGAGGACGCC
GTCGAGCGCG CAGAGGCGAT GGTCGCGGCC GACGTGGACG TGATCGACAT CGGCGGGGAG
TCAACCCGCC CCGGTGCAGA GCCGGTGCCG GTCGAGGAAG AAATCGAGCG CGTCGTCCCC
GTTATCGAGC GGATCGCGGA CCTCGATGTT GCTATCTCGA TCGATACGCG AAAGGCGGTC
GTCGCCGAGG CCGCACTCGA GGCCGGCGCG GACATCATCA ACGATGTCTC CGGGCTCGAG
GATCCCGAAA TGCGCTTCGT CGCGGCCGAG CACGACGCGG GGCTGATCGT GATGCACAGT
ATCGATTCGA TCGTCGACCC GGACCGCGAG GTGACCTACG ACGACGTCGT CGAGGACGTG
ATCGATCAGC TGTCAGATCG GCTGTTGCTT GCCGAGAAGG CAGGCGTCGA CCGGGAGCAG
ATCGTCGTCG ATCCCGGAAT TGGCTTCGGG AAGTCGGCGG CGGAGTCGTT CGAGATTCTG
GACCGACTCG AGGAGTTCTG CGCGCTCGGC TATCCGGTGC TCGTCGGTCA CTCGCACAAG
TCGATGTTCG CCCACGTCGG CCAGGGGCCG GACGAGCGGC GTGACGCGAC GGTTGCGGCG
AGTGCGCTCG CGGCGGATCG GGGTGCGGAT CTCATCCGGG TCCACGACGT GCCGGAGAAC
GTCGCGGCGG TCCGGACGGC GCTTGCAGCG CGGGATCCGG AGCGGTTCGA CTGGGAATCA
TAG
 
Protein sequence
MEYHEAADVL FGLRRFRPKP GTASTAQLLA TLDDPHEDVA FVQVAGSNGK GSTARMVEQT 
LRETGLSVGL YTSPHLEDLR ERVQVDGRKI PRSAVCAYVE SVHDYLTEQA ADGNSPTFFE
AMTAMAIWQF GRENVDVAVL EVGIGGRYDA TSVVDPVASA VTSVTLEHTG ILGETEAEIA
RDKAHVAPAD APLVTGVTGP VLESIREVAG EVVTVGTRGG GQGADGDDET TAPDVRVEYG
GRANHTEAAV SIDADDWSVS TQIPLPGTHQ AENAGIAAAL ARQIADVSTD ELARGLRNAH
WPGRFEVLDT EPLVVLDGAH NPGACEQLAA TLGTYEFDDL HLVFGAMHDK DHREMVAALP
TPDAVITAEP GLDRAEDRDV LATVFDDAGT AQVKTTRSVP DALSQALADA GPDDCVLVTG
SLFAVAEARS RWTTTGITKR IRDRSDARNA LESAQVSDAD VDRFDGEAVH RVVKTNLQSS
QANRLRTELV RLGGECVVSG AEDGHEERVD AVLMGTLAQF EQLVSTLDRE DGLETVAREI
GETLGLESAA TAAPESDTAT ATPTSAETPP WADRTAVMGI LNVTPDSFHD GGEFDALEDA
VERAEAMVAA DVDVIDIGGE STRPGAEPVP VEEEIERVVP VIERIADLDV AISIDTRKAV
VAEAALEAGA DIINDVSGLE DPEMRFVAAE HDAGLIVMHS IDSIVDPDRE VTYDDVVEDV
IDQLSDRLLL AEKAGVDREQ IVVDPGIGFG KSAAESFEIL DRLEEFCALG YPVLVGHSHK
SMFAHVGQGP DERRDATVAA SALAADRGAD LIRVHDVPEN VAAVRTALAA RDPERFDWES