Gene P9301_11701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_11701 
Symbol 
ID4911953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp981428 
End bp982639 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content33% 
IMG OID640160756 
Productputative lycopene beta cyclase 
Protein accessionYP_001091394 
Protein GI126696508 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAATAC TTGATATTTT AATTTTAGGT TCAGGTCCTG CAGCATTATG TTTAGCCTCA 
GAATTAGCAA AGCAGGATCT TAAAATTAAA GGAATATCAA CAAAATCTCC AAATCAAAAA
TGGGAGAATA CATATGGTAT CTGGGCATCT GAATTAGAAG AATTAGGGTT AGAGAACTTG
TTATCTCATC GATGGTGTAA AACAGTTAGT TTTTTTGGAG ATGGGGAAAA TAAAAAAGGG
GATACTCCGA CAAAGCATAA CTACGATTAT GGTTTGATAA ATCAGGAAGC CTTTCAAAAT
GAGCTTTTAA AAAAATGTAA AGGGATTGAA TGGTTGAATG AAACAGCAAC AGACATTAAA
GAAAAAAATA AACTATCTGA GGTAATTTGT TTTTCAGGTC TCAAAATAAA GGCGAGATTA
GTTATTGATG CAAGTGGTCA TAAAAGTAAT TTTGTAAAAA GACCAGTTCA AAATGAAATC
GCTCAACAAG CTGCTTACGG AATTGTAGGT AAATTTACAT CACCACCTGT TAATAAAGAA
CAATTTGTCC TAATGGATTT TCGTCCAAAT CATTTAAACA ATGAAGAAAA GTTATCATCT
CCTTCCTTTC TTTATGCAAT GGATCTTGGC AACGAGACTT TTTTTGTTGA AGAAACTTCA
TTAGCTAGTT ACCCTGCATT AACCCAAGAA AATCTTAAAA AAAGACTTTA TAAAAGACTT
AAGAGCAAAG GTATTGAGGT AAGTGAAATT TTTCATGAAG AGAATTGCCT TTTCCCTATG
AATTTACCCC TCCCATTTAA AAAACAATTT GTACTTGGTT TCGGAGGGGC TGCAAGTATG
GTTCATCCTG CATCAGGATA CATGGTTGGA TCCTTATTAA GAAGGGCTCC ACTCCTTGCA
CAAAAATTAG CACTCTTTTT AAAAGAACCT CATCTTAGTT CACTAGAGTT AGCTTCAAAA
GGTTGGGAAA TCCTATGGCC TTACGAGTTA ACACAAAGGC ATAAACTTTA CCAATACGGT
CTAAGAAGAT TGATGAGTTT TGACGAAAGT AGATTAAGAA GCTTTTTCTC AAATTTCTTT
AGATTATCAA CCAATGAATG GGTAGGTTTT CTTACTAATA CACTTCCACT TCCAAAACTA
ATTTACGTGA TGAGTAAGAT GTTTATAAAT TCACCCCTAA AAGTAAAACT AGGGATGCTC
AAGTTAAATT AG
 
Protein sequence
MEILDILILG SGPAALCLAS ELAKQDLKIK GISTKSPNQK WENTYGIWAS ELEELGLENL 
LSHRWCKTVS FFGDGENKKG DTPTKHNYDY GLINQEAFQN ELLKKCKGIE WLNETATDIK
EKNKLSEVIC FSGLKIKARL VIDASGHKSN FVKRPVQNEI AQQAAYGIVG KFTSPPVNKE
QFVLMDFRPN HLNNEEKLSS PSFLYAMDLG NETFFVEETS LASYPALTQE NLKKRLYKRL
KSKGIEVSEI FHEENCLFPM NLPLPFKKQF VLGFGGAASM VHPASGYMVG SLLRRAPLLA
QKLALFLKEP HLSSLELASK GWEILWPYEL TQRHKLYQYG LRRLMSFDES RLRSFFSNFF
RLSTNEWVGF LTNTLPLPKL IYVMSKMFIN SPLKVKLGML KLN