Gene PCC8801_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2029 
Symbol 
ID7104794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2101552 
End bp2102541 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content43% 
IMG OID643475088 
Producthopanoid-associated sugar epimerase 
Protein accessionYP_002372220 
Protein GI218246849 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR03466] hopanoid-associated sugar epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTA AAGCATTTAT CACAGGAGGA ACCGGATTTA TTGGGGCGAA TTTAGTTCGG 
TTATTATTAG ACCAAGGTTA TGAAGTCCGC GCATTAGTGC GTTCCCAAAG CCGTTTAGAT
AACCTAAAAG GGCTTGATAT TGAATTAGTA GAAGGAGATC TCAATGATGC CAATTTATCA
GAAAAAATCA GAGGAACTAA CGTCTTATTT CATGTAGCCG CCCACTATTC CCTTTATCAA
CGCGATCGCC ACCAACTTTA TCAAAGTAAT GTTTTAGGAA CCCGTTCCGT TTTAAAAGCA
GCCCAACAAG CCGGAATTGA ACGTACCATT TACACCAGTT CCGTCGCTGC TATTGGCGTT
GGAAACCCAT CAGAAATCGT CAACGAAACC CATCAAAGTC CCGTTGAAAA ATTAGTAGGA
CACTACAAAA AATCAAAATA TTGGGCTGAA CAAGAAGCCA AAAAAGCCGT TCAAAAAGGA
CAAGATATCG TCATCGTTAA CCCCAGTACC CCCATCGGTC CGTGGGACAT CAAACCCACT
CCAACAGGAG AGATTATCCT GCGGTTTTTA CGCCGTAAAA TGCCCGCCTA TGTAGATACT
GGATTAAATT TAATTGACGT GCGAGACGTA AGTTGGGGTC ATCTGTTAGC CTTAGAAAAG
GGTAAATCTG GAGAACGCTA TATTTTAGGT CATCAAAATC TCAGTCTCAA AGCCCTATTA
GACCAATTAT CCAGCCTCAC TGGATTAAGT GCACCCCAAA GAACTATCCC CTTGTGGCTA
CCCCTAACCA TGGCATGGAT TGACGAATCC CTTCTTACTC CTTTAGGAAA AACCCCGTCC
CTTCCCTTAG ATGGCGTTCG GATGTCTAAG TCACCGATGT ATTACGATGG ATCAAAAGCC
GTCAAAGAAT TAGGGTTGCC CCAATCACCT ATTAAAAAAG CCCTCCAAGA TGCAATTAGT
TGGTTTATCG ATCAAGGCTA TTCTTATTAA
 
Protein sequence
MAIKAFITGG TGFIGANLVR LLLDQGYEVR ALVRSQSRLD NLKGLDIELV EGDLNDANLS 
EKIRGTNVLF HVAAHYSLYQ RDRHQLYQSN VLGTRSVLKA AQQAGIERTI YTSSVAAIGV
GNPSEIVNET HQSPVEKLVG HYKKSKYWAE QEAKKAVQKG QDIVIVNPST PIGPWDIKPT
PTGEIILRFL RRKMPAYVDT GLNLIDVRDV SWGHLLALEK GKSGERYILG HQNLSLKALL
DQLSSLTGLS APQRTIPLWL PLTMAWIDES LLTPLGKTPS LPLDGVRMSK SPMYYDGSKA
VKELGLPQSP IKKALQDAIS WFIDQGYSY