Gene Syncc9605_0470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0470 
Symbol 
ID3737673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp459125 
End bp460684 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content60% 
IMG OID637775060 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_380799 
Protein GI78212020 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATTGC CCTGGTATCG GGTGCACACC GTTGTCATTA ATGACCCGGG CCGCCTTCTG 
GCTGTGCACC TCATGCATAC AGCCCTCGTA GCCGGCTGGG CCGGCTCCAT GGCTCTCTAC
GAGCTGGCCA TTTTCGATCC GTCTGACCCT GTCCTGAACC CCATGTGGCG TCAGGGCATG
TTCGTGATGC CTTTCATGTC CCGCCTTGGC GTGACCGGCA GCTGGGGTGG ATGGAGCATC
ACCGGCGAAA CGGGTGTTGA TCCCGGTTTC TGGAGTTTTG AGGGCGTTGC TGCCGCTCAC
ATAGTTTTCT CCGGCCTGAT GATGCTGGCC GCCATCTGGC ACTGGACTTA TTGGGATCTT
GAGATCTGGC AGGACCCCCG CACTGGCGAA CCCGCCCTCG ACCTGCCGAA GATCTTCGGC
ATCCACCTGC TTCTCGCAGG ACTCGGCTGC TTCGGTTTCG GTGCTTTCCA CCTCACTGGC
GTCTTCGGGC CAGGCATGTG GATTTCTGAC CCCTATGCAT TAACTGGTCA TCTCGAGGCG
GTTCAACCGT CTTGGGGGCC TGAAGGTTTC AACCCCTTCA ACCCCGGTGG CATCGTTGCC
CACCACATTG CCGCCGGCAT CGTCGGCATC ATTGCTGGCA TTTTCCACAT CACCACGCGA
CCGCCCGAGC GCCTCTACAA AGCGCTTCGG ATGGGCAACA TCGAAACGGT TCTGGCCAGC
GCCATCGCAG CCGTGTTCTT CGCAGCCTTC ATCGTGGCTG GAACCATGTG GTACGGCTCT
GCCGCGACCC CCGTCGAGCT GTTTGGCCCC ACCCGTTATC AGTGGGATCA GAACTACTTC
AAAACTGAGA TCAATCGTCG GGTTCAAACC GCGATGGATG ATGGTGCCAC CCAGGAAGAA
GCCTTCGAGG CCATCCCTGA GAAGCTCGCT TTTTATGACT ATGTTGGCAA CAGCCCCGCC
AAAGGTGGTC TGTTCCGCGT GGGTCCGATG GTGAACGGCG ATGGTTTGGC AACCGCCTGG
GTTGGTCACA TCGCATTCAG TGACAATGAA GGTCGCAACC TCGAAGTCCG TCGCCTGCCG
AACTTCTTCG AGAACTTCCC CGTCGTTCTG GAAGACGAGC AGGGCATCGT TCGTGCAGAC
ATTCCCTACC GTCGCGCAGA AGCCAAGTTC TCCTTCGAAC AACAAGGCGT GACCGCCAAG
GTGTTCGGTG GCGCACTTGA CGGCCAGACC TTCACTGACC CTGCCGACGT AAAGCGCCTT
GCCCGTAAGG CACAGCTGGG TGAAGCCTTC GACTTCGACC GTGAGACCTA CAACTCTGAC
GGCACGTTCC GCAGCTCGCC ACGCGGCTGG TTCACCTTTG GCCACGCCAC CTTCGCGCTG
CTGTTCTTCT TCGGTCACAT CTGGCACGGT GCCCGCACCC TGTACCGCGA CGTTTTCGCT
GGTATTGATC CCGACCTCGG AGATCAGGTG GAGTTTGGCC TCTTCGCCAA GCTCGGTGAC
AAAACCACCC GTCGCCTGCC CGAGGGCTAC GTTCCCCCCG CAGGAACTCC TCTCAACTGA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALV AGWAGSMALY ELAIFDPSDP VLNPMWRQGM 
FVMPFMSRLG VTGSWGGWSI TGETGVDPGF WSFEGVAAAH IVFSGLMMLA AIWHWTYWDL
EIWQDPRTGE PALDLPKIFG IHLLLAGLGC FGFGAFHLTG VFGPGMWISD PYALTGHLEA
VQPSWGPEGF NPFNPGGIVA HHIAAGIVGI IAGIFHITTR PPERLYKALR MGNIETVLAS
AIAAVFFAAF IVAGTMWYGS AATPVELFGP TRYQWDQNYF KTEINRRVQT AMDDGATQEE
AFEAIPEKLA FYDYVGNSPA KGGLFRVGPM VNGDGLATAW VGHIAFSDNE GRNLEVRRLP
NFFENFPVVL EDEQGIVRAD IPYRRAEAKF SFEQQGVTAK VFGGALDGQT FTDPADVKRL
ARKAQLGEAF DFDRETYNSD GTFRSSPRGW FTFGHATFAL LFFFGHIWHG ARTLYRDVFA
GIDPDLGDQV EFGLFAKLGD KTTRRLPEGY VPPAGTPLN