Gene Syncc9902_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1864 
Symbol 
ID3742177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1791891 
End bp1793450 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content56% 
IMG OID637772059 
Productphotosystem II chlorophyll-binding protein CP47 
Protein accessionYP_377865 
Protein GI78185430 
COG category 
COG ID 
TIGRFAM ID[TIGR03039] photosystem II chlorophyll-binding protein CP47 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGATTGC CCTGGTATCG GGTGCACACC GTCGTCATCA ATGACCCGGG TCGCCTTTTG 
GCGGTGCACC TCATGCATAC AGCCCTCGTT GCCGGCTGGG CCGGCTCCAT GGCTCTGTAC
GAATTAGCGA TTTTCGATCC ATCCGATGCT GTCCTGAACC CCATGTGGCG TCAGGGCATG
TTTGTGATGC CCTTCATGTC TCGCCTGGGA GTTACCGGGA GCTGGGGTGG TTGGAGCATC
ACTGGTGAAA CCGGGGTTGA CCCTGGCTTC TGGAGCTTCG AAGGTGTTGC CGCCGCCCAC
ATTATTTTTT CAGGCCTGCT GATGCTGGCC GCCATCTGGC ACTGGACCTA CTGGGATCTT
GAGATCTGGC AGGACCCCAG AACCGGAGAA CCAGCCCTTG ATCTTCCAAA GATTTTTGGC
ATTCACCTTC TACTAGCTGG CCTTGGCTGC TTTGGATTCG GTGCTTTCCA TCTCACTGGT
GTTTTTGGGC CTGGCATGTG GATTTCTGAT CCATATGGAA TTACTGGTCA CCTAGAGGCT
GTACAACCGT CTTGGGGTCC GGAAGGATTC AATCCGTTTA ACCCCGGTGG GATCGTTGCC
CACCACATTG CAGCTGGAAT TGTCGGCATC ATCGCTGGCA TTTTCCACAT CACCACGCGA
CCGCCCGAGC GCCTCTACAA GGCTCTCCGG ATGGGCAACA TTGAAACTGT CTTAGCGAGT
GCAATCGCGG CTGTTTTCTT TGCTGCTTTC ATCGTTGCTG GAACCATGTG GTACGGCTCA
GCTGCTACCC CAGTCGAGCT TTTCGGCCCT ACTCGTTATC AGTGGGATCA GAGCTACTTC
AAGACGGAGA TCAACCGTCG CGTTCAAACC GCTATGGACG ACGGTGCAAG CCGTGAAGAA
GCGTTTGCAG CCATTCCAGA GAAACTGGCG TTCTACGACT ACGTGGGCAA CAGCCCTGCC
AAGGGTGGAT TGTTCCGAGT TGGCCCGATG GTGAACGGTG ACGGCCTTGC CACTTCATGG
CTGGGTCACG TTGTGTTCAC CGACAGCAAT GGTCGTGAGT TGCAAGTTCG TCGTCTGCCG
AATTTCTTCG AGAACTTCCC AGTGATTCTG GAAGACGAGC AAGGCATTGT TCGTGCTGAC
ATTCCTTTCC GCCGTGCGGA AGCGAAGTAT TCCTTCGAAC AACAAGGCGT TACTGCTCAG
GTGTTTGGCG GAGCCTTGGA TGGTCAGAAG TTCACGGATC CTGCTGATGT GAAACGTTTG
GCCCGCAAGT CGCAGTTGGG AGAAGCGTTC GATTTCGATC GCGAAACCTA CAACTCTGAT
GGTGTCTTCC GCAGTTCACC TCGCGGTTGG TTCACGTTCG GCCACGCCAC CTTCGCGCTG
CTCTTCTTCT TTGGACACAT TTGGCATGGG GCACGCACCC TGTACCGCGA TGTGTTTGCT
GGTATCGATC CAGACCTTGG AGACCAGGTG GAATTCGGCC TGTTCGCCAA GCTGGGCGAC
AAGACCACAC GTCGTCTTCC AGAGGGCTAC GTGCCCCCTG CAGGAACGCC TCTCAACTGA
 
Protein sequence
MGLPWYRVHT VVINDPGRLL AVHLMHTALV AGWAGSMALY ELAIFDPSDA VLNPMWRQGM 
FVMPFMSRLG VTGSWGGWSI TGETGVDPGF WSFEGVAAAH IIFSGLLMLA AIWHWTYWDL
EIWQDPRTGE PALDLPKIFG IHLLLAGLGC FGFGAFHLTG VFGPGMWISD PYGITGHLEA
VQPSWGPEGF NPFNPGGIVA HHIAAGIVGI IAGIFHITTR PPERLYKALR MGNIETVLAS
AIAAVFFAAF IVAGTMWYGS AATPVELFGP TRYQWDQSYF KTEINRRVQT AMDDGASREE
AFAAIPEKLA FYDYVGNSPA KGGLFRVGPM VNGDGLATSW LGHVVFTDSN GRELQVRRLP
NFFENFPVIL EDEQGIVRAD IPFRRAEAKY SFEQQGVTAQ VFGGALDGQK FTDPADVKRL
ARKSQLGEAF DFDRETYNSD GVFRSSPRGW FTFGHATFAL LFFFGHIWHG ARTLYRDVFA
GIDPDLGDQV EFGLFAKLGD KTTRRLPEGY VPPAGTPLN