Gene Acid345_2811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2811 
Symbol 
ID4071814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3334536 
End bp3335984 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content60% 
IMG OID637984829 
Product6-phosphogluconate dehydrogenase (decarboxylating) 
Protein accessionYP_591886 
Protein GI94969838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0362] 6-phosphogluconate dehydrogenase 
TIGRFAM ID[TIGR00873] 6-phosphogluconate dehydrogenase, decarboxylating 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA CTTCCGCGCC TGCCGTGGGC AGCGCGCAAT TCGGCGTCGT TGGATTGGGC 
GTCATGGGGC AGAACCTCGC CCTGAACGTG TCCGATCACG GCCAGGTCGT TTCGGTGTGG
AACCTCGAAG CCGATTGGGT CACGAAATTC CTCACCCAAC ACAGCGACCG CAAGATGACC
GGCAGCGGCG ATCTCAAAGA GTTCGTACAG TCCCTCGCGC GACCGCGCCG CATCCTGATG
ATGATCATGG CCGGCAATCC GGTGGACCAG ATGCTCGACA AACTGGCGCC ACTGCTCGAA
CCCGGCGACA TCGTGATCGA CGGCGGCAAC TCCTTCTTCA AAGACACCCA GCGCCGCGAA
GCCGCTTATC GCACGAAAAA TCTGAACTTC TTTGGCATGG GCGTTAGTGG CGGCGAAGAG
GGCGCGCGCT TCGGACCGAG CTTGATGCCG GGCGGCGACG AAGCTTCCTA CGAGCACCTG
AAGCCGACGC TCGAGGCCAT CGCGGCGAAG ACCGATTACG GTGCTTGCGT CACCTACGTA
GGTCCGAACG GCGCGGGCCA CTTCGTGAAG ATGGTCCACA ACGGCATTGA GTACGGCGAC
ATGCAGTTGA TCGCCGAGGC TTACGACCTG CTGCGCAAAG CGCTTGGTCT TGGCGCGAAG
GAAATTGCCG CAATCTTTGA AGAGTGGAAC AAGGGCAAAC TCGAGTCGTA CCTGATCGAG
ATCACCTCAC ACGTGTTGCA GGTCGATGAT CCGGAGACCG GCAAGCCGCT GGTCGACATG
ATCCTCGACA AGGCCGGACA AAAAGGCACC GGCAAGTGGA CGCAGCAGAT TGCGCTCGAC
CTCGCAGTAC CGATTCCGAC AATTGGCGCG GCGCTCGATG CCCGCGTGCT CTCTTCAATG
AAAGATGAGC GCGTGGCTGC CTCGAAGAAA CTCGGCGCTC CTTCCCGCCA GTATTCGGGT
GACAAGAAAG AGTTCATCAA CGCGGTGCAG GATGCCTTAT ACGCGTCGAA GGTCTGCTCC
TATGCGCAGG GCATGAGCCT GATTCGCGCT GGCTCGAAAG AGTGGAACTG GAACGTGAAT
CTGCGCGAGA TGGCACGAAT CTGGACCGGC GGCTGCATTA TCCGTGCACG GCTGCTTGCT
GACATCATGC ATGCCTTCGA CCGCGATGCG AATGTAGCGA ATCTTCTGGT CGAGTCCGAA
TTCACGTCGC GCGTGCTGGA GTCTGAGAAA AACTGGCGCA GCGTGGTGCA GACCGCCGCT
GGGCTCGGAA TTCCGACTCC AGCGTTCTCG TCGTCGCTCG CGTACTTCGA CAGTTATCGA
TCCATGCAAC TGCCGCAGAA CCTGACGCAG GCGCAACGCG ACTTCTTCGG CGCCCACACG
TATCAACGCG CGGACCGTCC GGACGCCGGT TTCGTCCACA CCGATTGGAT TAAGTTGGTC
AAGAAGTAG
 
Protein sequence
MSTTSAPAVG SAQFGVVGLG VMGQNLALNV SDHGQVVSVW NLEADWVTKF LTQHSDRKMT 
GSGDLKEFVQ SLARPRRILM MIMAGNPVDQ MLDKLAPLLE PGDIVIDGGN SFFKDTQRRE
AAYRTKNLNF FGMGVSGGEE GARFGPSLMP GGDEASYEHL KPTLEAIAAK TDYGACVTYV
GPNGAGHFVK MVHNGIEYGD MQLIAEAYDL LRKALGLGAK EIAAIFEEWN KGKLESYLIE
ITSHVLQVDD PETGKPLVDM ILDKAGQKGT GKWTQQIALD LAVPIPTIGA ALDARVLSSM
KDERVAASKK LGAPSRQYSG DKKEFINAVQ DALYASKVCS YAQGMSLIRA GSKEWNWNVN
LREMARIWTG GCIIRARLLA DIMHAFDRDA NVANLLVESE FTSRVLESEK NWRSVVQTAA
GLGIPTPAFS SSLAYFDSYR SMQLPQNLTQ AQRDFFGAHT YQRADRPDAG FVHTDWIKLV
KK