Gene NATL1_14681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14681 
SymbolcrtH 
ID4780674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1175222 
End bp1176784 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content37% 
IMG OID640084749 
Productputative carotenoid isomerase 
Protein accessionYP_001015290 
Protein GI124026174 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02730] carotene isomerase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.314761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.880307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCAAT CATCAAATAA AGCCAACGAT TCTTGCAAGT GGGATTCAAT AGTTATTGGT 
TCCGGTCTGG GTGGATTAGT AACTGCAAGT CAACTAGCAA GCAAAGGTGC AAAGGTGCTT
GTTCTAGAAC AGTATAAGAT TCCGGGTGGA AGTGGTGGAT CATTTAAAAG AAAAGGATTT
ACTTTTGATG TTGGAGCATC AATGATCTTT GGGTTTGGTG ATAAGGGATA CACAAATTTA
CTAACGAGAG CTCTTAAAGA TGTTGGACAA AAATGTGAAA CGATTCCTGA CCCAACACAA
TTGGCATATC ATCTACCCAA TCAATTAGAA ATTTCAGTTG ATAGAGATTA TCAAAAATTT
ATTACCAAGC TAATAAATTT ATTTCCGCAT GAAGAGAAAG GAATAAAACA TTTTTACGAA
ACATGTTGGG ATGTATTTAA TTGTCTAGAT TCAATGCCTT TACTTTCAAT AGAGGATCCT
GCGTATCTAA TGAAAGTTTT TTTTAAATCA CCTTTATCAT GTCTTGGACT GGCTAGGTGG
TTGCCAGTTA ATGCTGGTGA TGTTGCTAGG CGATATATTA AAGATCCTGA TCTTTTAAGG
TTTATAGATA TTGAATGTTT CTGCTGGTCA GTTATGCCTG CCAACTTGAC TCCTATGATA
AATGCAGGAA TGGTCTTTTC TGATAGACAT TATGGAGGTA TTAACTACCC CAAAGGAGGT
GTAGGAATTA TTGCAGAAAA GCTAGTGAAG GGAATTAAAA ATCTGGGGGG GGAAGTTAGA
TATAAATCCA GAGTAAAAAA AATAATTTTG GAAAACGGAA ATGCCATAGG TGTTTCACTT
GAAAATGATG AAGAAATTTT CAGTAAAACA GTGGTCTCTA ATGCTACAAG ATGGGATACT
TTTGGTGGTC AAGGGATTAA GAGTCCACTT ATACCAACAG AGAACAAACC AACGTCTGAG
AAAAAATGGG AAAACAGATA TATACCTTCC CCTTCATTCT TATCTCTACA TTTGGGAGTG
AATGAAGACT CTATACATGC TAACACTCAT TGCCATCATC TACTTCTCGA CAAATGGGAA
GAAATGGAGA AAGAACAGGG AGTTACTTTC GTATCAATAC CAACTCTCTT AGATCCCACA
TTGGCACCTT CAGGTAGCCA TATTGTTCAT GCTTTTACTC CTTCTTCAAT AGATAACTGG
GATGGATTAA GCAATAAAGA ATATCTTACA AAGAAAAAAG AAGATGGTGA TAAACTTATT
TCAAAGTTAG AGAGATTATT TCCTAACCTA AGTCAAAATA TTCTTCACAA AGAAATAGGA
AGTCCAAGAA CTCATAAAAG ATTCCTTGCA AGAAACAAAG GAAGTTATGG ACCAATTCCA
TCAATGAGAT TGCCGGGCCT TCTCCCCATG CCATTCAACA CTACTAAAAT TAATGGACTT
TATTGTGTTG GGGACTCGTG CTTCCCCGGT CAAGGTTTGA ATGCCGTGGC TTTCAGTGGT
TATGCCTGTG CTCACAAGAT AGGCACAAAG CTTGGTATGA ACAAATGGGA GCTACCAGAA
TAA
 
Protein sequence
MTQSSNKAND SCKWDSIVIG SGLGGLVTAS QLASKGAKVL VLEQYKIPGG SGGSFKRKGF 
TFDVGASMIF GFGDKGYTNL LTRALKDVGQ KCETIPDPTQ LAYHLPNQLE ISVDRDYQKF
ITKLINLFPH EEKGIKHFYE TCWDVFNCLD SMPLLSIEDP AYLMKVFFKS PLSCLGLARW
LPVNAGDVAR RYIKDPDLLR FIDIECFCWS VMPANLTPMI NAGMVFSDRH YGGINYPKGG
VGIIAEKLVK GIKNLGGEVR YKSRVKKIIL ENGNAIGVSL ENDEEIFSKT VVSNATRWDT
FGGQGIKSPL IPTENKPTSE KKWENRYIPS PSFLSLHLGV NEDSIHANTH CHHLLLDKWE
EMEKEQGVTF VSIPTLLDPT LAPSGSHIVH AFTPSSIDNW DGLSNKEYLT KKKEDGDKLI
SKLERLFPNL SQNILHKEIG SPRTHKRFLA RNKGSYGPIP SMRLPGLLPM PFNTTKINGL
YCVGDSCFPG QGLNAVAFSG YACAHKIGTK LGMNKWELPE