Gene PCC8801_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3069 
Symbol 
ID7104545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3211838 
End bp3213364 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content53% 
IMG OID643476093 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_002373206 
Protein GI218247835 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTAC CTTGGTATCG AGTTCACACA GTTGTCCTGA ATGATCCAGG CCGACTTATT 
TCCGTTCACC TCATGCACAC CGCCCTTGTT GCGGGTTGGG CAGGTTCCAT GGCTCTGTAC
GAGCTAGCTA TTTTTGATCC GAGTGATCCC GTTCTCAACC CCATGTGGCG ACAAGGGATG
TTCGTCCTTC CCTTCATGGC CCGCTTAGGA GTCACTGGCT CCTGGGGTGG CTGGAGTGTC
ACCGGAGAAA CAGGTGTAAA CCCTGGTTTC TGGTCCTTTG AAGGCGTTGC TGCCGCCCAC
ATCGTTCTCT CTGGGTTACT CTTCCTAGCT GCCGTTTGGC ACTGGGTTTT CTGGGATCTC
GAACTCTTTG TTGATGCCCG TACTGGCGAA CCCGCCCTCG ACTTACCTAA GATGTTCGGG
ATTCACCTGT TCTTATCTGG GTTACTCTGC TTCGGTTTCG GAGCCTTCCA CCTCACCGGA
CTCTGGGGAC CGGGGATGTG GGTATCTGAC CCCTACGGCT TAACCGGCCA TGTCCAACCC
GTTGCCCCAG AATGGGGTCC GGCCGGGTTT AACCCCTTCA ACCCAGGGGG AGTTGTGGCT
CACCACATTG CAGCCGGAAT TGTGGGCATT ATTGCGGGTC TATTCCACCT AACGGTACGA
CCCCCCGAAC GGCTCTATAA AGCCCTCAGA ATGGGGAATA TTGAAACCGT TCTCTCTAGC
AGTATTGCCG CCGTCTTCTT TGCGGCCTTT GTCGTTGCTG GAACGATGTG GTACGGTAAC
GCAACCACCC CCATTGAACT GTTCGGACCG ACCCGTTATC AATGGGATAA TGGCTACTTC
AAACAAGAAA TTGAACGTCG TGTTGAAGCC AATGTAGCGG CGGGCGATAC TTTAGGGGAA
GCTTGGTCTA AAATTCCCGA AAAACTTGCC TTTTACGACT ATGTTGGCAA CAGCCCCGCA
AAAGGCGGTT TATTCCGTAC CGGAGCCATG GATAGTGGCG ATGGTATCGC CCAAGCTTGG
TTAGGTCATC CTGTCTTTAC GGACAAAGAC GGTCGGGAGT TAACCGTACG TCGGATGCCT
AACTTCTTTG AAACTTTCCC CATCGTTCTA ACCGATGCTG ATGGAGTCGT CCGTGCTGAC
ATTCCCTTCC GTCGGGCAGA ATCTAAACTG AGTATTGAGC AAAGCGGTGT TACCGTTAGC
TTCTATGGTG GTGCGCTTGA TGGCCAAAGC TTCAGCAACC CCGCTCAGGT TAAACAGTTT
GCTCGTCAAG CCCAATTAGG CGAACCCTTC GAGTTTGACC GCGAAACCCT CGGTTCTGAT
GGGGTATTCC GTACCAGTCC TCGCGGTTGG TTTACCTTCG GACACGCCGT CTTCGCCCTA
CTGTTCTTCT TTGGTCATAT TTGGCATGGT TCTCGTACCC TGTACCGAGA TGTCTTCGCT
GGAATTGACC CCGACCTAGA GGAACAAGTG GAATTTGGCT TGTTTGCTAA GGTGGGTGAC
TTGAGTACCC GTCGTACCGA GTCTTAA
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
FVLPFMARLG VTGSWGGWSV TGETGVNPGF WSFEGVAAAH IVLSGLLFLA AVWHWVFWDL
ELFVDARTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLTG LWGPGMWVSD PYGLTGHVQP
VAPEWGPAGF NPFNPGGVVA HHIAAGIVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS
SIAAVFFAAF VVAGTMWYGN ATTPIELFGP TRYQWDNGYF KQEIERRVEA NVAAGDTLGE
AWSKIPEKLA FYDYVGNSPA KGGLFRTGAM DSGDGIAQAW LGHPVFTDKD GRELTVRRMP
NFFETFPIVL TDADGVVRAD IPFRRAESKL SIEQSGVTVS FYGGALDGQS FSNPAQVKQF
ARQAQLGEPF EFDRETLGSD GVFRTSPRGW FTFGHAVFAL LFFFGHIWHG SRTLYRDVFA
GIDPDLEEQV EFGLFAKVGD LSTRRTES