Gene NATL1_05991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05991 
SymbolchlB 
ID4779761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp545118 
End bp546695 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content39% 
IMG OID640083876 
Productlight-independent protochlorophyllide reductase subunit B 
Protein accessionYP_001014426 
Protein GI124025310 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01278] light-independent protochlorophyllide reductase, B subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.694309 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAA CTCTCTGGAC ATACGAAGGA CCTCCTCATA TTGGTGCGAT GAGAATCGCT 
ACGTCTATGA AAAAATTGCA TTATGTATTG CATGCACCTC AAGGAGATAC CTACGCTGAT
CTTCTATTTA CGATGATTGA ACGCCGTGGG AGTAGACCAC CTGTAACTTA TACAACTTTT
CAAGCTAGAG ATTTAGGAGG TGATACCGCT GAACTTGTAA AAGGACATAT AAAAGAAGCC
GTAGATAGAT TTAAACCAGA AGCTCTTTTG GTTGGCGAAA GTTGTACAGC CGAATTAATT
CAAGATCAGC CTGGTTCACT CGCAAAAGGT ATGGGATTTG ATATCCCAAT TGTCAGCCTC
GAATTACCTG CATATAGCAA AAAGGAAAAC TGGGGTGGCT CTGAGACTTT TTATCAAATC
GTAAGAAGTT TACTTAAAGA TCACTCGAGA GAATCGAAAC AATCGTGGCA GGAAGAAAAA
AGAAGACCAA GAGTAAATCT TCTTGGCCCA ACTTTGCTTG GTTTTAGATG TCGTGATGAT
GTTTTGGAAA TACAAAAACT ACTTGGTCAG TACGGGATAG ACGTCAATGT TGTGGCGCCA
TTAGGAGCAT CACCTGCAGA TATATTGCGA ATCCCCAATG CTGATGTAAA TGTTTGCCTT
TATCCAGAAA TAGCGGAATC AACTTGTATT TGGCTTGAAA GAAATTTAAA TATTCCTTTT
ACAACAACAG TTCCGCTTGG TGTTGGGGCT ACTCAGGATT TTCTTAAAGA ATTACACAAA
GTGTTAGAGA TGGAAATCCC TCAATCAGTA AACGAATCTA ATAATTCGAA ATTAACTTGG
TACTCGAATT CAGTGGATTC GAATTACCTA ACAGGTAAAA GAGTCTTTAT TTTTGGAGAC
GGAACACACG CTCTCGCTGC AGCAAGAATT GCTAATGAGG AGCTTGGTTT TAAAGTTGTA
GGTCTAGGAA CTTATAGTCG AGAAATGGCT AGGAAAGTTC GTCCAGCCGC TAAGGCACTT
GGTTTAGAGG CATTGATAAC CAATGACTAC TTAGAAGTAG AAGATGCGAT AAAAGAAACA
TCTCCAGAAC TAGTCCTTGG CACACAAATG GAGCGACATA GTGCAAAAAG ACTTGGTATA
CCATGCGCTG TTATCAGTAC ACCAATGCAT GTGCAGGATG TACCTGCTCG ATATAGCCCT
CAAATGGGAT GGGAGGGGGC AAATGTCATT TTTGATGACT GGGTTCATCC ATTAATGATG
GGACTTGAGG AACATTTAAT TGGAATGTTT AAGCATGACT TCGAATTTGT TGATGGCCAC
CAAAGTCATC TAGGACATTT AGGTGGAAAA GGAACTCAAA ATACAACTAA AGAGGCTATA
AAAACAAACT TACAAGATTC AGTAATTACA GATGGCGATC CTATATGGAC ACATGAAGGT
GAAAAAGAAC TTTCGAAAAT CCCATTTTTT GTAAGAGGTA AGGTAAGAAG AAATACAGAG
AATTATGCTC GCCAAGCTGG ATGTAGAGAA ATCAACGAAG AAACTCTATA TGACGCTAAG
GCTCATTATA AAGCCTAA
 
Protein sequence
MELTLWTYEG PPHIGAMRIA TSMKKLHYVL HAPQGDTYAD LLFTMIERRG SRPPVTYTTF 
QARDLGGDTA ELVKGHIKEA VDRFKPEALL VGESCTAELI QDQPGSLAKG MGFDIPIVSL
ELPAYSKKEN WGGSETFYQI VRSLLKDHSR ESKQSWQEEK RRPRVNLLGP TLLGFRCRDD
VLEIQKLLGQ YGIDVNVVAP LGASPADILR IPNADVNVCL YPEIAESTCI WLERNLNIPF
TTTVPLGVGA TQDFLKELHK VLEMEIPQSV NESNNSKLTW YSNSVDSNYL TGKRVFIFGD
GTHALAAARI ANEELGFKVV GLGTYSREMA RKVRPAAKAL GLEALITNDY LEVEDAIKET
SPELVLGTQM ERHSAKRLGI PCAVISTPMH VQDVPARYSP QMGWEGANVI FDDWVHPLMM
GLEEHLIGMF KHDFEFVDGH QSHLGHLGGK GTQNTTKEAI KTNLQDSVIT DGDPIWTHEG
EKELSKIPFF VRGKVRRNTE NYARQAGCRE INEETLYDAK AHYKA