Gene Cyan8802_3051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3051 
Symbol 
ID8392381 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3084580 
End bp3086106 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content53% 
IMG OID644981000 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_003138732 
Protein GI257060844 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000997071 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.381945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTAC CTTGGTATCG AGTTCACACA GTTGTCCTGA ATGATCCAGG CCGACTTATT 
TCCGTTCACC TCATGCACAC CGCCCTTGTT GCGGGTTGGG CAGGTTCCAT GGCTCTGTAT
GAGCTAGCTA TTTTTGATCC GAGTGATCCC GTTCTCAACC CCATGTGGCG ACAAGGGATG
TTCGTCCTTC CCTTCATGGC CCGCTTAGGA GTCACTGGCT CCTGGGGTGG CTGGAGTGTC
ACCGGAGAAA CAGGTGTAAA CCCTGGTTTC TGGTCCTTTG AAGGCGTTGC TGCCGCCCAC
ATCGTTCTCT CTGGGTTACT CTTCCTAGCT GCCGTTTGGC ACTGGGTTTT CTGGGATCTC
GAACTCTTCG TTGATGCCCG TACTGGCGAA CCCGCCCTCG ACTTACCTAA GATGTTCGGG
ATTCACCTGT TCTTATCTGG GTTACTCTGC TTCGGTTTCG GAGCCTTCCA TCTTACCGGC
CTCTGGGGAC CGGGGATGTG GGTATCTGAC CCCTACGGCT TAACCGGCCA TGTCCAACCC
GTTGCCCCAG AATGGGGTCC GGCCGGGTTT AACCCCTTCA ACCCAGGGGG AGTTGTGGCT
CACCACATTG CAGCCGGAAT TGTGGGCATT ATTGCGGGTC TATTCCACCT AACGGTACGA
CCCCCCGAAC GGCTCTATAA AGCCCTCAGA ATGGGAAATA TTGAAACCGT TCTTTCTAGC
AGTATTGCCG CCGTCTTCTT TGCGGCCTTT GTCGTTGCTG GAACGATGTG GTACGGTAAC
GCAACCACCC CCATTGAACT GTTCGGACCG ACCCGTTATC AATGGGATAA TGGCTACTTC
AAACAAGAAA TTGAACGTCG TGTTGAAGCC AATGTAGCGG CGGGCGATAC TTTAGGGGAA
GCTTGGTCTA AAATTCCCGA AAAACTCGCC TTTTACGACT ATGTTGGCAA CAGCCCCGCA
AAAGGCGGGT TATTCCGTAC CGGAGCCATG GATAGTGGCG ATGGTATCGC CCAAGCTTGG
TTAGGTCATC CTGTCTTTAC GGACAAAGAC GGTCGGGAGT TAACCGTACG TCGGATGCCT
AACTTCTTTG AAACTTTCCC CATCGTTCTA ACCGATGCTG ATGGAGTCGT CCGTGCTGAC
ATTCCCTTCC GTCGGGCAGA ATCTAAACTG AGTATTGAGC AAAGCGGTGT TACCGTTAGC
TTCTATGGTG GTGCGCTTGA TGGCCAAAGC TTCAGCAACC CCGCCCAAGT TAAACAGTTT
GCCCGCCAAG CCCAATTAGG CGAACCCTTC GAGTTTGACC GCGAAACCCT CGGTTCTGAT
GGGGTATTCC GTACCAGTCC TCGCGGTTGG TTTACCTTCG GACACGCCGT CTTCGCCCTA
CTGTTCTTCT TTGGTCATAT TTGGCATGGT TCTCGTACCC TGTACCGAGA TGTCTTCGCT
GGAATTGACC CCGACCTAGA GGAACAAGTG GAATTTGGCT TGTTTGCTAA GGTGGGTGAC
TTAAGTACCC GTCGTACCGA GTCTTAA
 
Protein sequence
MGLPWYRVHT VVLNDPGRLI SVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
FVLPFMARLG VTGSWGGWSV TGETGVNPGF WSFEGVAAAH IVLSGLLFLA AVWHWVFWDL
ELFVDARTGE PALDLPKMFG IHLFLSGLLC FGFGAFHLTG LWGPGMWVSD PYGLTGHVQP
VAPEWGPAGF NPFNPGGVVA HHIAAGIVGI IAGLFHLTVR PPERLYKALR MGNIETVLSS
SIAAVFFAAF VVAGTMWYGN ATTPIELFGP TRYQWDNGYF KQEIERRVEA NVAAGDTLGE
AWSKIPEKLA FYDYVGNSPA KGGLFRTGAM DSGDGIAQAW LGHPVFTDKD GRELTVRRMP
NFFETFPIVL TDADGVVRAD IPFRRAESKL SIEQSGVTVS FYGGALDGQS FSNPAQVKQF
ARQAQLGEPF EFDRETLGSD GVFRTSPRGW FTFGHAVFAL LFFFGHIWHG SRTLYRDVFA
GIDPDLEEQV EFGLFAKVGD LSTRRTES