Gene A9601_11691 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_11691 
Symbol 
ID4717882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp982555 
End bp983766 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content33% 
IMG OID640078884 
Productputative lycopene beta cyclase 
Protein accessionYP_001009560 
Protein GI123968702 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.451912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAC TTGATATTTT AATTTTAGGT TCCGGGCCTG CAGCACTGTG CTTAGCTTCA 
GAATTAGCCA AGCAAGATCT TAGTATAAAG GGAATATCAA CTAAATCTCC AAATGAAAAA
TGGGAGAATA CATATGGAAT TTGGGCATCT GAATTAGAGG AATTAGGATT AGAGTCATTG
TTATCTCATC GGTGGTGTAA AACAGTTAGT TTTTTTGGAA ACGGGGAAAA TAAAAAGGGA
GATAATCCAA CAAAACATAA TTATGATTAT GGTTTAATAA ATCAAGAAGC TTTTCAAAAT
GAACTTTTAA AAAAGTGCAA AGGGATTGAA TGGTTGAATG AAACGGCAAA AGATATTAAA
GAGAAAAATA AAATATCTGA GGTTATTTGT TCTTCAGGAC TAAGAATAAA GGCGAGGTTA
GTCATTGACG CAAGTGGTCA TAAGAGTAAT TTTGTAAAAA GACCCGTACA AAATGAAATC
GCTCAACAAG CTGCATATGG AATTGTCGGT AAATTTTCAT CCCCACCAGT CAAAAAAGAA
CAGTTTGTTT TAATGGATTT TCGTCCAAAT CATTTAAACA ATGAAGAAAA GTTATCATCA
CCATCCTTTC TCTATGCAAT GGATCTTGGA AATGAAACTT TTTTTGTTGA GGAAACATCA
TTAGCTAGTT ATCCTGCACT ATCCCAAGAT AATCTAAAAA AAAGACTTTT CAAAAGACTT
AATAATAAGG GTATTGAGGT GAGTGAAGTT TTTCATGAAG AGAATTGCCT TTTTCCAATG
AATTTACCCC TCCCATTTAA AAAACAATTT GTTCTTGGTT TTGGAGGTTC AGCAAGCATG
GTGCATCCTG CATCAGGATA CATGATCGGA TCTTTATTAA GGAGAGCTCC ACTACTCGCA
GAAAAATTGG CGATCTTTTT AAAAGAACCT AATCTAAGTT CTCTTGAACT AGCGACAAAA
GGATGGGGGG TCCTTTGGCC TTACGAGTTA ACACAAAGGC ATAAACTTTA CCAATATGGT
TTAAGAAGAT TGATGAGTTT TGACGAAAGT AAATTAAGAA GCTTTTTCTC AAATTTCTTT
AAATTATCGA CCAATGAATG GGTAGGATTT CTTACTAATA CACTTCCTCT TCCAAAACTT
ATTTATGTGA TGAGTAAAAT GTTTATAAAT TCACCTCTAA AGGTAAAACT AGGAATGCTT
AAATTAAATT AG
 
Protein sequence
MEILDILILG SGPAALCLAS ELAKQDLSIK GISTKSPNEK WENTYGIWAS ELEELGLESL 
LSHRWCKTVS FFGNGENKKG DNPTKHNYDY GLINQEAFQN ELLKKCKGIE WLNETAKDIK
EKNKISEVIC SSGLRIKARL VIDASGHKSN FVKRPVQNEI AQQAAYGIVG KFSSPPVKKE
QFVLMDFRPN HLNNEEKLSS PSFLYAMDLG NETFFVEETS LASYPALSQD NLKKRLFKRL
NNKGIEVSEV FHEENCLFPM NLPLPFKKQF VLGFGGSASM VHPASGYMIG SLLRRAPLLA
EKLAIFLKEP NLSSLELATK GWGVLWPYEL TQRHKLYQYG LRRLMSFDES KLRSFFSNFF
KLSTNEWVGF LTNTLPLPKL IYVMSKMFIN SPLKVKLGML KLN