Gene NATL1_05981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_05981 
SymbolchlN 
ID4779847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp543854 
End bp545113 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content39% 
IMG OID640083875 
Productlight-independent protochlorophyllide reductase subunit N 
Protein accessionYP_001014425 
Protein GI124025309 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01279] light-independent protochlorophyllide reductase, N subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.926498 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGTG CAACGCTCCT TAAAGAATCT GGTCCAAAAG AAGTCTTTTG CGGGCTAACT 
TCTATCGTTT GGCTGCATAG AAGAATGCCT GATGCTTTCT TCCTTGTTGT GGGTTCTAGA
ACCTGTGCGC ATTTAATTCA AAGTGCAGCA GGCGTAATGA TCTTTGCTGA ACCACGCTTT
GGAACAGCTA TTTTAGAAGA GAGAGATTTA GCTGGATTAG CTGATGCTCA TGACGAGTTA
AACCGAGTAG TAAAAAATCT TTTAGCCAGA CGTCCCGAAA TAAAAACTCT TTTTCTTGTT
GGCTCTTGCC CAAGTGAAGT AATAAAAATA GATCTTTCAA GGGTTGCTGA AAATCTGAAT
ATCGAACTTA AAGGTCAAGT AACAGTATTG AATTATTCGG GAAGTGGAAT AGAAACAACT
TTCACTCAAG GCGAAGACGG AGCTTTAAAG GCTTTGATTC CATTGATGCC GAAGAGCGAT
CAAAAGAAAT TACTTTTAGT TGGAACTCTT GCAAATGCTG TGGAGGACCG TTTAGTAAGT
ATTTTTAATC GGCTTGGAAT AGATAATGTT GAAAGTTTTC CACCTAGGCA GTCAACAGAA
TTACCTTCTA TTGGTCCAGA GACCAAAGTA CTTCTTACTC AACCCTACTT AACTGATACG
GCAAGAGAGC TTAAAAATAA AGGTGCTGAG ATAATAGAAG CGCCCTTTCC TCTAGGTGTT
ACGGGTAGCA CATTGTGGAT TCAGGCTGCT GCAAATTCAT TTGGCATCGA TAAATCTATT
GTTGATTCGA TATTGAATCC ACTGATATCG AGGGCAAAGA AAGCTTTGGA GCCTCATGTT
GAGAAACTTT CTGGTAAGAA ATTGTTCCTT TTGCCAGAAT CTCAATTAGA AATACCTCTC
GCAAGATTTC TAAGTAATGA GTGTGGAATG GAGATTGTTG AAATAGGAAC GCCTTATTTA
AATAGAGATT TAATGAAAGC AGAAATAGAC TTGCTACCTC CTGATTGTCG TATTGTCGAA
GGACAACATG TAGAGAAACA ATTAGACAGA GTAAGAGATA GTTCGCCAGA TCTTGTTGTT
TGTGGAATGG GACTTGCCAA TCCACTTGAA GCAGAGGGGA TATCAACCAA ATGGTCAATT
GAAATGGTTT TCAGCCCAAT TCACGGGATT GATCAAGCTT CAGATTTAGC AGAATTGTTT
TCAAGGCCAC TTCGCAGGCA TGACATTTTA AATCCTACTA AAACTCTTAC ATCAAACTAA
 
Protein sequence
MSGATLLKES GPKEVFCGLT SIVWLHRRMP DAFFLVVGSR TCAHLIQSAA GVMIFAEPRF 
GTAILEERDL AGLADAHDEL NRVVKNLLAR RPEIKTLFLV GSCPSEVIKI DLSRVAENLN
IELKGQVTVL NYSGSGIETT FTQGEDGALK ALIPLMPKSD QKKLLLVGTL ANAVEDRLVS
IFNRLGIDNV ESFPPRQSTE LPSIGPETKV LLTQPYLTDT ARELKNKGAE IIEAPFPLGV
TGSTLWIQAA ANSFGIDKSI VDSILNPLIS RAKKALEPHV EKLSGKKLFL LPESQLEIPL
ARFLSNECGM EIVEIGTPYL NRDLMKAEID LLPPDCRIVE GQHVEKQLDR VRDSSPDLVV
CGMGLANPLE AEGISTKWSI EMVFSPIHGI DQASDLAELF SRPLRRHDIL NPTKTLTSN