Gene Syncc9902_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_2036 
Symbol 
ID3742996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1945429 
End bp1946508 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content58% 
IMG OID637772233 
Productphotosystem II reaction centre protein PsbA/D1 
Protein accessionYP_378037 
Protein GI78185603 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA CCATCCAGCA GCGCTCCGGC GCTTCTAGCT GGCAGTCCTT CTGCGAGTGG 
GTCACCTCCA CCAACAACCG TCTGTATGTC GGTTGGTTCG GTGTGCTGAT GATCCCAACT
CTGTTGGCTG CCACCATCTG CTTCGTCATC GCATTCGTCG CCGCTCCTCC GGTTGACATC
GATGGCATCC GCGAGCCTGT CGCTGGCTCC TTGATGTACG GCAACAACAT CATCTCTGGT
GCTGTTGTTC CTTCCAGCAA CGCCATTGGC TTGCACTTCT ATCCCATCTG GGAAGCAGCT
TCACTCGACG AGTGGCTGTA CAACGGCGGT CCTTTCCAGC TCGTCGTCTT CCACTTCCTC
ATCGGCATCT ACGCCTACAT GGGTCGTGAG TGGGAACTCT CTTACCGCTT GGGCATGCGC
CCTTGGATCT GTGTTGCATA CAGCGCACCT GTCGCTGCTG CATCTGCAGT CTTCCTCGTC
TACCCCTTCG GTCAGGGTTC GTTCTCTGAT GCAATGCCCC TGGGCATCTC TGGAACCTTC
AACTACATGT TGGTGTTCCA GGCTGAGCAC AACATCCTGA TGCACCCCTT CCACATGCTG
GGTGTTGCAG GCGTCTTCGG CGGCAGCTTG TTCTCCGCCA TGCACGGCTC ACTGGTGACC
TCCTCCTTGG TGCGTGAAAC CACCGAAAGC GAGTCCCAGA ACTACGGCTA CAAATTCGGC
CAAGAAGAAG AGACGTACAA CATCGTGGCT GCTCACGGCT ACTTCGGTCG CCTGATCTTC
CAATACGCCT CCTTCAACAA CAGCCGTAGC CTCCACTTCT TCCTGGCTGC CTGGCCCGTT
GTCGGCATCT GGTTCACCGC CCTTGGCGTG TCAACCATGG CCTTCAACCT GAACGGCTTC
AACTTCAACC AGTCCATCCT TGATGGTCAG GGCCGCGTCC TGAACACCTG GGCCGACGTG
TTGAACCGTG CAGGCCTCGG CATGGAAGTC ATGCACGAGC GCAACGCTCA CAACTTCCCC
CTCGACCTGG CAGCTGCTGA GTCCACACCT GTGGCACTGC AAGCACCTGC AATCGGTTGA
 
Protein sequence
MTTTIQQRSG ASSWQSFCEW VTSTNNRLYV GWFGVLMIPT LLAATICFVI AFVAAPPVDI 
DGIREPVAGS LMYGNNIISG AVVPSSNAIG LHFYPIWEAA SLDEWLYNGG PFQLVVFHFL
IGIYAYMGRE WELSYRLGMR PWICVAYSAP VAAASAVFLV YPFGQGSFSD AMPLGISGTF
NYMLVFQAEH NILMHPFHML GVAGVFGGSL FSAMHGSLVT SSLVRETTES ESQNYGYKFG
QEEETYNIVA AHGYFGRLIF QYASFNNSRS LHFFLAAWPV VGIWFTALGV STMAFNLNGF
NFNQSILDGQ GRVLNTWADV LNRAGLGMEV MHERNAHNFP LDLAAAESTP VALQAPAIG