Gene NATL1_01891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01891 
SymbolcrtQ 
ID4779287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp176722 
End bp178182 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content40% 
IMG OID640083453 
Productzeta-carotene desaturase 
Protein accessionYP_001014018 
Protein GI124024902 
COG category[S] Function unknown 
COG ID[COG3349] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02732] carotene 7,8-desaturase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.905208 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.646889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAATCG CAATAATAGG TGCAGGTTTA GCAGGATTGA CTGCTGCAGT TGACCTTGTT 
GATGAAGGAC ATGACGTCGA CCTATACGAA GCAAAACCTT TTATAGGAGG TAAAGTTGGA
AGCTGGGAAG ATTCCGATGG CAATCATATA GAGATGGGTT TGCATGTCTT CTTTTTCAAC
TATGCCAATC TGTTTTCATT AATGAGGAAA GTAGGCGCGT TTGAAAATCT TTTACCAAAA
GACCACACTC ACCTTTTTGT TAACAAGGGC GGTGACATTA AATCACTTGA TTTTCGATTT
TTTGCAGGAG CTCCTTTTAA TGGCTTAAAA GCCTTCTTCA CTACACCTCA ATTAAATTTG
ATTGACAAGC TAAGAAATGC TCTTGCTCTT GGCACAAGTC CGATAGTAAG AGGGCTGATT
GACTACGAGG GTGCGATGAA AACAATACGG TCACTAGATT CTATAAGCTT TCAAAAATGG
TTTCTTAACC ATGGAGGAAG CCTAAATAGT ATCGAAAGGA TGTGGAATCC GATTGCATAT
GCGCTGGGAT TCATTGATTG CGAAGCCATT TCTGCAAGAT GCATGCTTAC CATCTTTATG
ATGTTCGCTT CTAAGACAGA GGCCTCTAAG CTCAACCTAT TGAAGGGATC ACCCCACAAA
TGGCTCACTA AGCCAATACT CGATTACATT GAACAAAGAG GCGGAAAACT TCACTTAGAA
AATATTGTTA AAGAAATTCA TTCAGAGGAT TCTGATCATC CATCTGTGAC CGGAATAACC
CTTCAAACTC CCGAGGGTGA ACAAACAATT AAAGCCGACA AATATCTAGC TGCTTGCGAT
GTGTCTGGTA TAAAAAGAAT AATTCCTAGA TCATGGAGAC GTTTTAAAGA GTTCGACTCA
CTTTTTAAGC TTGATGCTGT TCCAGTAGCG ACAGTACAAC TTAGATACGA TGGCTGGGTA
ACGGAGATCA ATAATCAACA AGCTCAAAAA AACCTAGAAA GTCCATCTGG TTTAGACAAC
TTGTTATATA CAGCCGATGC TGATTTTAGT TGTTTTGCAG ATTTAGCTTT ATCAAGCCCA
GAAGACTATA AAAAAGAAGG TCAGGGCTCT CTACTTCAAT GTGTGTTAAC GCCTGGAGAT
CCATGGATTA CCAAGTCCTC TGATGAAATC GTAAAACATA CTGATTTACA GGTCAGGACA
CTTTTCCCTT CTTCAAGAGG TTTAAAGCTT TTGTGGAGCA ATGTGGTCAA GGTTTCTCAC
TCTCTCTACA GGGAGGCTCC TGGAATGGAG CCATATAGAC CAGATCAAAA AACTTCATTT
AGTAATTTTT TCTTAGCAGG CAGTTACACA AAACAGGACT ACATTGACTC TATGGAAGGA
GCAACAATGA GTGGACATCT TGCTGCTTCA GCAATGCTCT CAAAGTCTGT TTCACTAGCA
AAAAATTCTT CGGTTGCTTA A
 
Protein sequence
MKIAIIGAGL AGLTAAVDLV DEGHDVDLYE AKPFIGGKVG SWEDSDGNHI EMGLHVFFFN 
YANLFSLMRK VGAFENLLPK DHTHLFVNKG GDIKSLDFRF FAGAPFNGLK AFFTTPQLNL
IDKLRNALAL GTSPIVRGLI DYEGAMKTIR SLDSISFQKW FLNHGGSLNS IERMWNPIAY
ALGFIDCEAI SARCMLTIFM MFASKTEASK LNLLKGSPHK WLTKPILDYI EQRGGKLHLE
NIVKEIHSED SDHPSVTGIT LQTPEGEQTI KADKYLAACD VSGIKRIIPR SWRRFKEFDS
LFKLDAVPVA TVQLRYDGWV TEINNQQAQK NLESPSGLDN LLYTADADFS CFADLALSSP
EDYKKEGQGS LLQCVLTPGD PWITKSSDEI VKHTDLQVRT LFPSSRGLKL LWSNVVKVSH
SLYREAPGME PYRPDQKTSF SNFFLAGSYT KQDYIDSMEG ATMSGHLAAS AMLSKSVSLA
KNSSVA