Gene P9211_05471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_05471 
SymbolchlN 
ID5731252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp513073 
End bp514329 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content41% 
IMG OID641284906 
Productlight-independent protochlorophyllide reductase subunit N 
Protein accessionYP_001550432 
Protein GI159903088 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01279] light-independent protochlorophyllide reductase, N subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG CATCGCTGCT AAAGGAAACA GGGCCTAGAG AAGTCTTCTG TGGACTTACA 
TCCATTGTTT GGCTTCATCG AAGAATGCCC GATGCCTTTT TTCTTGTTGT TGGATCAAGA
ACATGTGCGC ACTTAATTCA AAGTGCTGCT GGCGTCATGA TCTTTGCAGA GCCACGTTTT
GGAACTGCCA TCCTCGAAGA GAGAGATCTG GCTGGATTAG CAGATGCTCA TGAGGAACTT
GACCGAGTAG TAAAAAATCT TCTAACAAGG CGTCCAGAAA TTCGAACTTT ATTTCTTGTT
GGCTCGTGCC CTAGTGAAGT GATCAAAATT GATCTAGCAA GGGCTGCTGA AAGACTTAAT
TCTCAATTCA ATGGAAAAGT AACCATCCTC AATTATTCAG GAAGTGGAAT TGAGACAACT
TTCACTCAAG GAGAAGATGG TGCTCTTAAA GCTTTTGTCC CATTAATGCC ATCTAGCGAT
AAAAAACAAT TGCTATTAGT TGGCACATTG GCAAATGCAG TTGAAGATCG TTTAATCACA
ATATTCAAAA GACTAGGCAT AGAGAACGTT GATAGTTTCC CGCCTAGACA ATCCACGGAG
TTACCTTCGA TTGGGCCAGA AACGAAAGTT CTACTAACTC AGCCATATTT AACAGATACT
GCAAGGGTCC TGAAAGACAG AGGTGCTGAA ATACTTCCAG CACCTTTCCC ACTAGGAGTT
GAAGGCAGCA GACTTTGGAT AGAAGCAGCC GCTAAATCTT TTAATGTTGA CCAATCATTA
GTTACTTCAA CATTAGAACC TTTAATTTTA CGTGCTCGAA AAGCCCTTAA GCCCTATATA
GAAAAACTGA CTGGAAAAAA ACTCTTTCTT TTACCTGAAT CACAATTAGA GATACCACTT
GCACGTTTTC TACATATGGA ATGTGGAATG GAACTTCTAG AGATTGGGAC CCCTTATTTA
AATAGGGACA TGATGAAACC TGAGCTAGAT CTTCTCCCTG ATAAAACTCG AATTGTTGAA
GGACAACACG TAGAAAAGCA GCTTGATCGT GTTCGTAAAA ACCAACCAGA CCTTGTTGTA
TGTGGGATGG GGCTTGCTAA TCCACTCGAA GCAGAAGGCT TTTCCACTAA ATGGTCAATT
GAAATGGTAT TCAGCCCAAT CCATGGAATA GATCAAGCAT CAGACCTTGC AGAACTTTTT
TCAAGGCCCC TTCACCGCCA CGATCTTTTA AATACCAAAC AACTCACAAG CACTTAA
 
Protein sequence
MSGASLLKET GPREVFCGLT SIVWLHRRMP DAFFLVVGSR TCAHLIQSAA GVMIFAEPRF 
GTAILEERDL AGLADAHEEL DRVVKNLLTR RPEIRTLFLV GSCPSEVIKI DLARAAERLN
SQFNGKVTIL NYSGSGIETT FTQGEDGALK AFVPLMPSSD KKQLLLVGTL ANAVEDRLIT
IFKRLGIENV DSFPPRQSTE LPSIGPETKV LLTQPYLTDT ARVLKDRGAE ILPAPFPLGV
EGSRLWIEAA AKSFNVDQSL VTSTLEPLIL RARKALKPYI EKLTGKKLFL LPESQLEIPL
ARFLHMECGM ELLEIGTPYL NRDMMKPELD LLPDKTRIVE GQHVEKQLDR VRKNQPDLVV
CGMGLANPLE AEGFSTKWSI EMVFSPIHGI DQASDLAELF SRPLHRHDLL NTKQLTST