Gene Sare_3824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3824 
Symbol 
ID5703786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4356161 
End bp4357597 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content70% 
IMG OID641273246 
Product6-phosphogluconate dehydrogenase 
Protein accessionYP_001538608 
Protein GI159039355 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0362] 6-phosphogluconate dehydrogenase 
TIGRFAM ID[TIGR00873] 6-phosphogluconate dehydrogenase, decarboxylating 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.75231 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.03216 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC AGGCACAGAT CGGCGTGACC GGGCTGGCGG TGATGGGGCG CAACCTCGCC 
CGGAACCTGG CCCGCAACGG CCTCACGGTG GCAGTACACA ACCGCTCCCC GGAACGGACC
CGCGGGCTGG TCGCCGAGCA CGGCGACGAG GGACGGTTCG TGCCCACCGA GTCGATGGCG
GACTTCGTCG CCGCGCTGGA ACGACCCCGG GCAGTCATCA TGATGGTCAA GGCTGGTGGG
CCGACCGACG CCGTCATCGA CGAGTTGGTG CCGCTGCTCG ACGCCGGCGA CATCATCGTC
GACTGCGGCA ACGCCCATTT CGCCGACACC CGGCGGCGCG AGGAGGCGCT GCGCAGGCAC
GACCTGCACT TCGTCGGCAC CGGCGTCTCC GGCGGCGAGG AGGGTGCGCT GTGGGGGCCG
AGCATCATGC CCGGTGGATC GGCCGAGTCC TACCGGAAAC TCGGGCCGAT CTTCGAGCGG
ATTGCGGCGC AGGTGGACGG CGAGCCCTGC TGCCGCCACA TCGGTCCAGA CGGAGCCGGC
CACTTCGTCA AGATGGTCCA CAACGGCATC GAGTACGCCG ACATGCAGCT CATCGCCGAG
GCGTACGACC TTCTGCGGGC CGGCCTGGAC GCGACGCCGG CCGAACTGGC GGAGACCTTC
CGGCAGTGGA ACTCCGGCGA GCTGGAGTCG TTCCTCATCG AGATCACCGC CGACGTGCTC
GGACACACCG ACGCGAGCAC CGGACAGGCG TTCGTGGACG TCGTCCTCGA CCAGGCCGAG
CAGAAGGGTA CCGGGCGCTG GACCGTGCAG AGCGCCCTCG ACCTGGGCAT CCCGATCACC
GGCATCGCCG AGGCCACATT CGCGCGTTCG CTCTCCGGGC ACGCCGACCA ACGGGAGGCC
ACCCGCCGCG CGTTCGCCGG CACCGGACCG GCCTGGCAGG TAGCGGACCG GGACACCTTC
GTCGAGGACG TCCGGCGTGC GCTGCTGGCC AGCAAGATCG TCGCGTACGC GCAGGGCTTC
GACCACATCC GGGCTGGCAG CCAGGAGTAC GACTGGAACA TCGACCTGGG CGGCACCGCC
ACGATCTGGC GGGGAGGGTG CATCATCCGG GCACGCTTCC TCGACCGGAT CCGTCAGGCG
TACGACGATC ATCCCGACCT GCCCACCCTG CTGGTGGCAC CGTGGTTCGC CGACACCGTA
CGCGACGGGG TGCCGGGGTG GCGACGCGTG GTCGCCGAGG CTGCCCAGGC CGGTGTACCC
ACCCCCGCGT TCGCCTCCTC CCTGTCCTAC TTCGACGCAC TCCGCGCGAA TCGCCTCCCG
GCGGCCCTGA TCCAGGGTCT GCGGGACAAC TTCGGCGCGC ACACCTACCG CCGGGTCGAC
CGTGACGGCT CCTTCCACAC GATCTGGGCC GGCGACCACC ACGAGGTCGA AGCCTGA
 
Protein sequence
MSGQAQIGVT GLAVMGRNLA RNLARNGLTV AVHNRSPERT RGLVAEHGDE GRFVPTESMA 
DFVAALERPR AVIMMVKAGG PTDAVIDELV PLLDAGDIIV DCGNAHFADT RRREEALRRH
DLHFVGTGVS GGEEGALWGP SIMPGGSAES YRKLGPIFER IAAQVDGEPC CRHIGPDGAG
HFVKMVHNGI EYADMQLIAE AYDLLRAGLD ATPAELAETF RQWNSGELES FLIEITADVL
GHTDASTGQA FVDVVLDQAE QKGTGRWTVQ SALDLGIPIT GIAEATFARS LSGHADQREA
TRRAFAGTGP AWQVADRDTF VEDVRRALLA SKIVAYAQGF DHIRAGSQEY DWNIDLGGTA
TIWRGGCIIR ARFLDRIRQA YDDHPDLPTL LVAPWFADTV RDGVPGWRRV VAEAAQAGVP
TPAFASSLSY FDALRANRLP AALIQGLRDN FGAHTYRRVD RDGSFHTIWA GDHHEVEA