Gene NATL1_15221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_15221 
Symbol 
ID4780699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1237265 
End bp1238509 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content37% 
IMG OID640084804 
Productputative lycopene beta cyclase 
Protein accessionYP_001015344 
Protein GI124026228 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID[TIGR01790] lycopene cyclase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAATTGA ATAATGTCGC TGATGTATTA GTCATGGGAG CAGGACCTGC TGCTCTTTGC 
ATCGCTGCTG AGTTAGTTCA ACACGGACTT GATGTTCAGG CGATTGCTTC TAAGTCTCCT
TTAGAACCTT GGCCAAACAC TTATGGGATT TGGGCATCTG AACTTGAGTC TTTAAATATG
CAAGAACTAT TGAAATATAG ATGGGAAGAT ACTGTTAGCT TTTTTGGAGA TGGATTAGGT
GGAAAAGGTA ATATTTGTAC AAATCATTAT CTTGATTACG GCCTCTTTAA TTCAATTAAT
TTTCAGGAGG CTCTTCTTGA GAGATGTAAT GGACTTCCTT GGCAACTAGA AACTGTTGAT
AATATTGATT TTAGAGAAAG AGAGACCGTT GTTATTTGTA CTTCCGGAAA AAAATATTTT
GCTAGGCTCG TTATTGATGC AAGTGGTTAT AAAACCCCTT TTATCAGGAG GCCTAAGCAT
GATCAAATCG CTAAGCAAGC GGCATATGGG GTGGTTGGGA AATTTAGTTC TGCTCCTGTA
GAGAAAAATC GTTTTGTATT GATGGATTTT AGATCAGACC ATTTAAATGC CAACGAATTA
GAGGAGCCAC CTTCTTTCCT TTATGCTATG GACCTTGGAG ATGGTAGTTA TTTTGTAGAA
GAAACATCTT TGGCTTGTTC ACCTCCAATT TCATTTGAAT CATTAAAAGC AAGATTAAAT
TTACGACTAT CTAATAAAGG TATTCAAATA GACGAAATTT TCCATGAAGA ACATTGTCTT
TTTCCAATGA ACTTGCCATT GCCTTATAGA GATCAACCCC TTTTGGCCTT TGGAGGCTCG
GCCAGTATGG TTCATCCTGC TTCGGGATAT CTTGTTGGAT CCCTTTTAAG GAGAGCACCT
TCATTAGCAA GTGAAATAGC AAAAGTAATT AAAAAAGAAC CTCTTATGAC TACATCTCAG
ATAGCCATAA GAGGATGGAA AACCCTATGG ACAAATGAAT TAGTTCAAAG ACATCGTCTT
TATCAGTTTG GACTTCAAAG ACTAATGAGC TTTGACGAAA CTTTATTAAG ATCTTTTTTT
GATACTTTTT TTAAATTACC TAAAAAAGAT TGGTTCGGAT ATTTAACTAA TACGCTTCCT
TTGCCAAGAC TTTTTATTGT GATGCTCAAA CTATTTTACA TCGCCCCATC CAAGGTCAGG
TTGGGAATGA CTGGTTTACT TATAAACAAG CGTGAAAAAA CTTAG
 
Protein sequence
MKLNNVADVL VMGAGPAALC IAAELVQHGL DVQAIASKSP LEPWPNTYGI WASELESLNM 
QELLKYRWED TVSFFGDGLG GKGNICTNHY LDYGLFNSIN FQEALLERCN GLPWQLETVD
NIDFRERETV VICTSGKKYF ARLVIDASGY KTPFIRRPKH DQIAKQAAYG VVGKFSSAPV
EKNRFVLMDF RSDHLNANEL EEPPSFLYAM DLGDGSYFVE ETSLACSPPI SFESLKARLN
LRLSNKGIQI DEIFHEEHCL FPMNLPLPYR DQPLLAFGGS ASMVHPASGY LVGSLLRRAP
SLASEIAKVI KKEPLMTTSQ IAIRGWKTLW TNELVQRHRL YQFGLQRLMS FDETLLRSFF
DTFFKLPKKD WFGYLTNTLP LPRLFIVMLK LFYIAPSKVR LGMTGLLINK REKT