Gene PCC8801_0894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0894 
Symbol 
ID7101993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp939672 
End bp941021 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content44% 
IMG OID643473988 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002371128 
Protein GI218245757 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTTT GTGTCATTGG AACTGGTTAC GTCGGCTTAG TGACAGGGGT TTGTTTAGCT 
CATATTGGCC ACCACGTTAT CTGTGTAGAC AATAACGAAG AAAAAGTCAA ATTAATGAAA
TCTGGTCAGT CCCCCATTTA TGAACCGGGT TTATCCGAAT TGATGCACTC GAGTGCCGAG
TCGGGACACT TGGAATTTAC GACAGATTTA GCAGCCGGGG TTAATCACGG GGAAATCCTC
TTTATTGCCG TAGGAACTCC CGCATTACCC AACGGAGAAA GCGATACTCG TTATGTAGAA
GCCGTAGCGC GGGGTATTGG AGCTAACTTA AATCAGGGTT ACAAAGTCAT TGTCAATAAA
TCAACGGTTC CCATCGGTTC TGGAGACTGG GTGAGGATGA TTGTCCTCGA TGGGTTAGCA
GAACGCCAAA ACGGACACGC GGATGCAGAG TTTGATGTGG TGAGTAACCC CGAATTTCTC
AGAGAAGGCT CGGCGGTTTA TGATACCTTT AACCCCGATC GCATTGTTTT AGGCAGCAAC
AGCGACAAAG CGATCGCCAT GATGCAAGAA CTCTACGCCC CCCTAGTGGA TCGTAAATTT
GGCGAGGATA TAACCTTACC TCCCGTTCCT GTGGTGGTAA CTGACCTAAA CTCAGCAGAA
ATGATTAAAT ATGCTGCTAA TGCCTTCTTA GCGACGAAGA TTAGTTTTAT TAATGAAGTC
GCTAATATCT GCGATCGCGT TGGGGCGGAT GTTACCCAAG TTGCTAAGGG GATCGGATTA
GACTCCCGTA TTGGCAATAA ATTCTTGCAA GCGGGGATCG GTTGGGGTGG ATCATGTTTT
CCCAAGGATG TTTTAGCCTT AATTCACACA GCAACCGACT ATGGTTATGA GACGGAATTG
TTAAACGCGG CGGTTCATGT TAATCAACGT CAACGACTCA TTGCCATTGA AAAATTACAA
CAAGAATTGA AGATTCTTAA GGGAAAAACC GTTGGATTAT TAGGGTTAAC CTTTAAACCT
GATACTGATG ACATGAGGGA TGCTCCTTCT TTAATTATTA TCGAACAACT CAACCGTTTA
GGAGCAAAAG TGAAGGCATA CGATCCGATT GTTTCACAAT CAGGGTTAAG TCATGGCTTA
TCGGGAGTGA TTATTGAAAC CAACCCCGAA ATGTTGGCCG ATAGTTGTGA TGCTTTGGTT
TTAGTGACAG ATTGGCAAGA ATTTCTGAAA CTCGATTATG GGAAAATGGC CAGTTTAATG
GCTAATCCTG TGATTATTGA TGGTCGTAAT TTCTTAGATC GCTCGAAACT AGAACAGGCC
GGTTTCCGTT ACTTAGGAAT TGGTCGGTAA
 
Protein sequence
MRVCVIGTGY VGLVTGVCLA HIGHHVICVD NNEEKVKLMK SGQSPIYEPG LSELMHSSAE 
SGHLEFTTDL AAGVNHGEIL FIAVGTPALP NGESDTRYVE AVARGIGANL NQGYKVIVNK
STVPIGSGDW VRMIVLDGLA ERQNGHADAE FDVVSNPEFL REGSAVYDTF NPDRIVLGSN
SDKAIAMMQE LYAPLVDRKF GEDITLPPVP VVVTDLNSAE MIKYAANAFL ATKISFINEV
ANICDRVGAD VTQVAKGIGL DSRIGNKFLQ AGIGWGGSCF PKDVLALIHT ATDYGYETEL
LNAAVHVNQR QRLIAIEKLQ QELKILKGKT VGLLGLTFKP DTDDMRDAPS LIIIEQLNRL
GAKVKAYDPI VSQSGLSHGL SGVIIETNPE MLADSCDALV LVTDWQEFLK LDYGKMASLM
ANPVIIDGRN FLDRSKLEQA GFRYLGIGR