Gene PCC8801_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0043 
Symbol 
ID7103709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp45598 
End bp46611 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content40% 
IMG OID643473159 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_002370306 
Protein GI218244935 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID[TIGR01179] UDP-glucose-4-epimerase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATA GACAGAGAGT AATCTTAGTT ACAGGGGGAG CAGGATATAT TGGATCTCAT 
GTGGTACGGG TTCTCTTAGA AGCTGGTTAT CAAGTGATTA TTCTTGATAA CTTAATCTAT
GGACATCGAG ATCTGGTGGA AACCATTTTA AAAGTAGAGT TAATTATAGG GGATATTGGC
GATCTCGCCC TACTAGATCA CCTATTTTCT AGCCATTCCA TTGAGGCAGT CATGCACTTT
GCGGGGTTTG GTTATGTGGG TGAATCCATT CAACATCCTC AAAAATACTA CCGTAATAAC
GTTGCCAATA CTCTAACCTT ATTAGAAGCG ATGAACCAAG CTTCTGTCAA TAAATTGGTC
TTTTCTTCAA CCTGTGCTAC CTATGGAATC GCTCAAACGT TTCCCATTAC CGAAAAACAC
CCACAGCAAC CAATTAATAC CTATGGCAAG AGTAAATTAA TGGTAGAACG GATGCTGAAG
GATTTTTCCC AAGCTTATCC CCTCAAATAT GTCTGTTTTC GCTATTTTAA TGCAGCCGGA
GCTCATCCAG ACGGATTGCT TGGAGAAGAT CATAACCCAG AATCCCATCT CATTCCCTTA
GTACTGTTAA CAGCATTGGG AAAACGGGAG TCCATCTCCA TTTTTGGGAC AGACTATCCC
ACCCCTGATG GGACTTGTAT TCGAGATTAT CTTCATGTGA TGGATATTGC CCAAGCCCAC
CTTTTAGGGT TAGAGTATTT ATTAGCCAAT GAAACCTCTA ATGTGTTTAA TTTAGGTAAT
GGTAATGGTT TTTCCATTCA ACAAGTGATT GATACGTCCA TGGACATAAC TCAAAGACCG
ATTTCAGTCA ACCTAGTTAA TCGCCGTCCT GGTGATCCCC CGATTTTAGT GAGTAGTAAT
GAAAAAGCAC GCCAAATTCT CGGATGGAAA CCCCAATATC CTAATTTAGA AGAAATTCTT
GCTCATGCTT GGCAATGGCA TCAAAAACGT CATCAAATTA CTGATTTAAC TTAA
 
Protein sequence
MSDRQRVILV TGGAGYIGSH VVRVLLEAGY QVIILDNLIY GHRDLVETIL KVELIIGDIG 
DLALLDHLFS SHSIEAVMHF AGFGYVGESI QHPQKYYRNN VANTLTLLEA MNQASVNKLV
FSSTCATYGI AQTFPITEKH PQQPINTYGK SKLMVERMLK DFSQAYPLKY VCFRYFNAAG
AHPDGLLGED HNPESHLIPL VLLTALGKRE SISIFGTDYP TPDGTCIRDY LHVMDIAQAH
LLGLEYLLAN ETSNVFNLGN GNGFSIQQVI DTSMDITQRP ISVNLVNRRP GDPPILVSSN
EKARQILGWK PQYPNLEEIL AHAWQWHQKR HQITDLT