Gene PCC8801_3807 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3807 
Symbol 
ID7102106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3991997 
End bp3993367 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content47% 
IMG OID643476812 
ProductOpcA protein 
Protein accessionYP_002373913 
Protein GI218248542 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3429] Glucose-6-P dehydrogenase subunit 
TIGRFAM ID[TIGR00534] opcA protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACTC AAACCCCCCC TATCGTTTCT TTGCAAGACC CCAAGGATGT CTCTATTGAT 
GTCATTGAGG CAGAATTACG CGCTATATGG CAAAATTACA GTAACAATGA AGATGGTATT
GCGGCCACCC GTGCTACAAC CTTTACCTTT ATTGTCTACG AACCCGAACC GACCCAATAT
CTCTTAGCCG TGTTAGGCTT CTATACCGGT CCCGTCGATG GTATTGCTGG ACCGCGTACC
ACGGCTGCCA TTAAATCAGC CCAAAAAGCC TATAATCTCG AAATTACGGG CAAATCTAAC
GAAGCTTTTA TTACAACCTT ACAGGCCGAA TTTGAGAAAG CCAAAGCAGA AGGTCAATTA
ACCGATGCTC AACAACTGCT GGCAAAAGCC TATTCTCCTG ACCTAGAAGG GACTGGAGTG
GCTGATAGTA TCGCAGCTTC TAACCCTTGC CGTATTATTA CCCTGTGTCC GACTATCGGG
GAAGATGAAG GGGTTAAGGC GCAGGTATCG GCCTATTGTC CCATTAATAA GCGCAGTCAT
AATGCTTTGA TTTGTTGCGA ATACATTACC CTACGGGGAA CAGCCGCCGC GTTAGAACGA
ATTGGTGGCA TTATCTCCGA ATTGGTGATT ACTGGACTAC CGACTTTTGT TTGGTGGAAA
GCCAGTCCTC AACCCGAATA TGGACTGTTT AAACGCTTAG CCTCCCAAGG GGATACGGTT
ATTATTGATT CTAGTACCTT TAGCAGTCCT CAAGAAAACC TGTTAGAAAT CGGTCAGATG
GTTGAGCAGA AAATTCCCCT AGCGGACTTA AACTGGGCAA GAATTGGACC TTGGCAAGAG
TTAACCGCAG CCGCGTTTGA CCCCCCAGAA CGCCGCAGTG CGGTGCTAGA GGTAGACCGT
GTTACCATTG ATTATGAACG GGGCAATGCT TCCCAAGCGT TCATGTATTT GGGATGGGTA
GCCAGTCGTC TGCAATGGCG ACCTGTTGCC TATGAGTATG AGGGTGGAGA CTACGATATT
CGTCGGGTTA AGTTCCTCAA TAATGAACAA AAGACCATTG AAGCAGAGTT AGCAGGGGTT
CCTTTGGCAG ACTGGGGAGA CGTTTTAGGC GACTTAATTA GTCTTAAGTT GAGTTCGACT
AACCTCCAAG CTGACTGTTG TACCGTGTTG TGCTCAGGAA CCACTGGATG TATGCGAATG
GAAGCATCAG GAGGAGCCCA AGCTTGTCGT ATTGAACAAG TGACCTCTTT AGCCGATCAA
AATACGGAAT ACTTATTAGG AAGACAACTG CAACGGTGGG GCACTGATGC GCTTTATGAG
GAGAGTTTGA AGGTGACTAT GGCTATCCTT AAATTAGCCA GTAATGACTA A
 
Protein sequence
MTTQTPPIVS LQDPKDVSID VIEAELRAIW QNYSNNEDGI AATRATTFTF IVYEPEPTQY 
LLAVLGFYTG PVDGIAGPRT TAAIKSAQKA YNLEITGKSN EAFITTLQAE FEKAKAEGQL
TDAQQLLAKA YSPDLEGTGV ADSIAASNPC RIITLCPTIG EDEGVKAQVS AYCPINKRSH
NALICCEYIT LRGTAAALER IGGIISELVI TGLPTFVWWK ASPQPEYGLF KRLASQGDTV
IIDSSTFSSP QENLLEIGQM VEQKIPLADL NWARIGPWQE LTAAAFDPPE RRSAVLEVDR
VTIDYERGNA SQAFMYLGWV ASRLQWRPVA YEYEGGDYDI RRVKFLNNEQ KTIEAELAGV
PLADWGDVLG DLISLKLSST NLQADCCTVL CSGTTGCMRM EASGGAQACR IEQVTSLADQ
NTEYLLGRQL QRWGTDALYE ESLKVTMAIL KLASND