Gene A9601_12521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_12521 
Symbol 
ID4717969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1058789 
End bp1059871 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content44% 
IMG OID640078971 
Producthypothetical protein 
Protein accessionYP_001009643 
Protein GI123968785 
COG category 
COG ID 
TIGRFAM ID[TIGR01151] photosystem II, DI subunit (also called Q(B)) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTA TTCAGCAGCA GCGTTCTTCG CTGTTAAAAG GTTGGCCACA GTTTTGTGAG 
TGGGTAACAT CAACTAACAA CAGAATTTAT GTTGGTTGGT TCGGCGTCTT AATGATTCCA
TGCCTTCTTA CAGCAGCGGC TTGCTTCATC GTTGCATTCA TCGCAGCACC ACCAGTAGAC
ATCGACGGAA TTAGAGAGCC AGTTGCTGGT TCATTCCTAT ATGGAAACAA CATCATCTCA
GGTGCAGTTG TTCCTTCATC TAACGCTATT GGTCTACACT TCTACCCAAT TTGGGAAGCA
GCTACTGTAG ATGAGTGGTT ATACAACGGT GGTCCTTACC AGCTTGTAAT TTTCCACTTC
CTAATTGGTA TCTCAGCATA CATGGGAAGA CAGTGGGAGC TTTCATACCG TTTAGGTATG
CGTCCTTGGA TCTGTGTTGC ATACTCTGCA CCAGTTTCAG CAGCTTTCGC AGTATTTCTT
GTATACCCAT TCGGTCAAGG TTCATTCTCT GACGGAATGC CTTTAGGTAT CTCTGGAACA
TTCAACTTCA TGTTTGTTTT CCAGGCAGAG CACAACATTC TTATGCACCC ATTCCACATG
GCTGGTGTTG CTGGTATGTT CGGAGGATCT TTATTCTCAG CTATGCACGG TTCACTTGTT
ACTTCATCTC TAATCAGAGA AACAACTGAG ACAGAGTCTC AGAACTATGG TTACAAGTTC
GGACAAGAAG AAGAAACATA TAACATCGTT GCAGCTCATG GCTACTTCGG TCGTTTGATC
TTCCAATATG CTTCATTCAA CAACAGCAGA AGTCTTCACT TCTTCCTAGC TGTATTCCCA
GTTGTTTGTG TATGGTTAAC TTCAATGGGT ATCTGCACAA TGGCATTCAA CCTTAACGGT
TTCAACTTCA ACCAGTCAGT TGTTGATGCA AACGGTAAGA TTGTTCCTAC ATGGGGTGAC
GTTCTTAACA GAGCAAACCT AGGTATGGAA GTAATGCACG AGCGTAACGC TCACAACTTC
CCACTTGATC TAGCAGCAGC TGAGTCTACA ACAGTAGCTC TTTCAGCTCC AGCTATCGGT
TAA
 
Protein sequence
MTTIQQQRSS LLKGWPQFCE WVTSTNNRIY VGWFGVLMIP CLLTAAACFI VAFIAAPPVD 
IDGIREPVAG SFLYGNNIIS GAVVPSSNAI GLHFYPIWEA ATVDEWLYNG GPYQLVIFHF
LIGISAYMGR QWELSYRLGM RPWICVAYSA PVSAAFAVFL VYPFGQGSFS DGMPLGISGT
FNFMFVFQAE HNILMHPFHM AGVAGMFGGS LFSAMHGSLV TSSLIRETTE TESQNYGYKF
GQEEETYNIV AAHGYFGRLI FQYASFNNSR SLHFFLAVFP VVCVWLTSMG ICTMAFNLNG
FNFNQSVVDA NGKIVPTWGD VLNRANLGME VMHERNAHNF PLDLAAAEST TVALSAPAIG