Gene NATL1_20111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_20111 
Symbol 
ID4779537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1654438 
End bp1655955 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content38% 
IMG OID640085303 
Productphytoene dehydrogenase and related proteins 
Protein accessionYP_001015831 
Protein GI124026716 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02733] C-3',4' desaturase CrtD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.428487 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AATCTATCAT TGTTGTTGGG GGAGGGATAG CTGGCTTAAC AGGTGCTGCT 
CTTCTCTCTA AAGAGGGTTA TCAGGTAACT TTGGTCGAAG CACATAGTCA ATTAGGTGGT
TGTGCAGGAA CTTTTAAAAG AGGTTCTTAT ACCTTTGATG TTGGCGCTAC CCAAGTTGCA
GGTCTTGAGA GAGGAGGAAT TCATCATCGT TTATTTAATT ATTTAGATAT TCCTTTACCT
GATGCAAAAA TTTTGGATCC TGGCTGTTCA GTCACTCTCG GTGATGGGAG TAGGCCAATC
AATCTTTGGC ATGATCCATT GAGATGGCAG AAAGAAAGAC AAGAACAGTT TCCTGGGAGT
GAAATCTTCT GGTCATTATG TTCTAAAATT CATGAAAGTA ACTGGGAATT TGTCGAAAGA
GATCCAATAC TTCCGGTAAG AAATTTTTGG GATTTAAGTC AATTAATTAG AGCCATACGT
CCTTCAAATC TTTTTACTGG TTTTCTGAGT AAGTTAACTA TTACAGACTT GCTTAAAATA
ACTGGTTGTC ATAAAGATAG ACGTCTACGA AGTTTTTTAG ACCTTCAATT AAAACTTTAC
TCTCAAGAGC CAGCAAGTAG AACGGCAGCT TTATACGGTG CAACTGTTCT TCAAATGGCC
CAATCTCCTA GAGGTCTATG GCATCTTCAT GGATCAATGC AAATTCTTAG CGATTTGTTG
AAAGATAGTT TCTTGAGAGA TGGTGGAAGC CTTTTGATTG GGCATAGAGT GACCAAAATA
ATAAGGAAAG AAAATTCAAA TATTTTTGAT GTCAATGTGA TTGATAGAAG AAAGAATTTG
ATACGGATGA AAGCATCAGA TATTGTTTTT AGCTTGCCTC CTCAATCACT TTTAGATTTA
ATTCCTATTG ATGGAGGTTT ATCTACAACA TACCGTGAAA GTATAAAAAA TTTACCTAAA
CCCAGTGGTG CTATTGTATT TTACGGAGCT CTCCGTCGTG TAGATTTGCC CGTTGATTGT
CCAGGTCATA TTCAAGTATT TGATGAGCAT TTTGGTTCTT TATTTATTTC TATCAGTATG
GAAAATGATC AACGTGCACC AGTTGGGATG GCCACTTTAA TAGCAAGTGT ATTTGTAGAT
ATTGATCAAT GGTCTAATCT AGATAGTCAA TCATATATCA GAAAAAAAAA TGTTGTATCG
AAACAGATTA GAGCTATTTT AGATCACAAG TTTGATTTGC TAGAGACGAG TTGGGATCAT
CAAGAACTTT CTACCCCAAG AAGTTTTGAA AGGTGGACTG GACGTCCTAG TGGAATAGTC
GGAGGGCTTG GTCAACATCC AGATCAATTT GGTCCTTTTG GCCTGTCAAG TAGAACCCCT
CTCAGAGGTT TATGGCTTTG TGGAGATTCT ATATACCCTG GTGAAGGTAC CGCTGGCGTA
AGTCAATCAG CTTTGATGGT TGTAAGACAA TTATTGGAAT CTAAAGGTAG ACATCTAAAC
ATCCCTGTCT TTAATTAG
 
Protein sequence
MSEESIIVVG GGIAGLTGAA LLSKEGYQVT LVEAHSQLGG CAGTFKRGSY TFDVGATQVA 
GLERGGIHHR LFNYLDIPLP DAKILDPGCS VTLGDGSRPI NLWHDPLRWQ KERQEQFPGS
EIFWSLCSKI HESNWEFVER DPILPVRNFW DLSQLIRAIR PSNLFTGFLS KLTITDLLKI
TGCHKDRRLR SFLDLQLKLY SQEPASRTAA LYGATVLQMA QSPRGLWHLH GSMQILSDLL
KDSFLRDGGS LLIGHRVTKI IRKENSNIFD VNVIDRRKNL IRMKASDIVF SLPPQSLLDL
IPIDGGLSTT YRESIKNLPK PSGAIVFYGA LRRVDLPVDC PGHIQVFDEH FGSLFISISM
ENDQRAPVGM ATLIASVFVD IDQWSNLDSQ SYIRKKNVVS KQIRAILDHK FDLLETSWDH
QELSTPRSFE RWTGRPSGIV GGLGQHPDQF GPFGLSSRTP LRGLWLCGDS IYPGEGTAGV
SQSALMVVRQ LLESKGRHLN IPVFN