Gene P9301_12591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_12591 
Symbol 
ID4911424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1066441 
End bp1070085 
Gene Length3645 bp 
Protein Length1214 aa 
Translation table11 
GC content34% 
IMG OID640160848 
Producthypothetical protein 
Protein accessionYP_001091483 
Protein GI126696597 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain
[TIGR02167] bacterial surface protein 26-residue repeat 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTA AAAACAGAAG GAAGATATTA TCTTTCATTT ATAGGTATTC ACTTAGAAGC 
TTTTTTTTAT TGATTCTATT TAATGTCAAT AAATCACTAG CTAATAATTG CCAGAACATT
GGTGAAATTG GAACTTCTGG CAGTTGTAAT GGAAAACTAA TTGTTAGCAG ACAAAATCTT
CTCGACGCAA TCTCAGATGG AAGCTATGCC GTTATTGGTC CAGGCAATGT TAGTTACACA
TTTGCTGAGG GTGGTACTGG TGATATTTAT ACAGGAAACA TAAATGATTT CTCTTCATTA
TTTAAAGGTA AAAAAACTTT CAATCAAGAT ATTGGGTATT GGGATACAAG CAGTGCCACA
AATATGTCTG AAATGTTTTC TAATGCCAGG AGATTCAATC AAGACATCAG TAATTGGGAT
GTAAGTAATG TAACTAAAAT GAACAGAATG TTCTTAAACG CAAGATATTT TAATCAAGAC
ATTAACGGTT GGGATGTAAG TAATGTTGAG CAAATAAATC TCATGTTTAG GAATGCGCAT
AGGTTCAATC AATCTTTAAA TAGTTGGGAT GTAGGGAATG TGACACAAAT GGCTCATATC
TTTAGGAGTG CAAAAGCATT TAATGGCAAT ATTTCTGCTT GGAACACCTC AAAAGTGAAA
AATTTCATAG CAGTTTTTGA TGGTGCAAAA AGTTTTAATC AAGACATAAG TAATTGGGAT
GTCAGCTCTG GAACAAGAAT GAATCATTTC CTAAGGAACA ATGCTGTTTT CAATAAGGAT
CTATCTAGCT GGGATGTGCG CAAATTTCGC TCTGAGCCTA GTCATTTTGC TCCAAATTTA
TTAACTGCGG GAGGAGTAAA ACCATGTTGG GGATTAAATG GATGCGCATC TGCAGATTTA
ATTCCGGTTC TAAGTTCCTA TTCACCAAAT AATTTTGATG TTTCTCATGG AAGCAATCTT
GATTTGGAGT TGAATTTTAA TATGGCAGTT GAACTGGTAA GTAAAAAAAG TAATGTTATC
CTTCACAAAA TGAGTGGAAG CAACCTAAAA AAAGTTGCAA CATATAATTT ACTCAAATCA
GACAAAGTTA GTTTCTCAGA TGACAAAACA AAGATTACAA TCAATATTGG ACCTAAAATT
ACAGACAATA CAAAATATGT TGTCCAGATA AGACCTGGAA GTATCAAATC CTCCTCTTCA
GGGGCATATT TCCAAGGTAT CCAACCTGAA TATCAAAAAT CAGGAAGTAT ATGGTTTTCG
ACAGGCAGTA ATGATCAAGT ACTTGACATC ATAGGAACCA CTCCATCAAG TGGATCAAAT
TCATTAGAGA CTGAAAATCC TCAGATAATA ATCAGATTCA GTGAAGATAT TGCTCTTGGT
ACAGGGAATG TAACCCTAAA AAAATATTCT GATGATTCTA CTGTCAGAGC TTTTAACGTT
GCCAATTCAA CAGATCAGGA AGATTTACAA ATAAACGACA CTGATTTGAC CATAAAATTA
GTAGATACTA ATGGTGATAC ATTAGTTGTT GGATCCACCA AATACTATTT GCAAGTAGAT
GCAACTGCAA TTGATAATGC AGGCTCATCT AAATCATTCG CAGGAATTAG CAATAAAGAT
GCATATTCAT ACACAACTAT TTCTGCCAGT AATTGCGGTG CAATAACTGG TCAGGCAAAA
TATTGGAAAG GTAAAGGAGC AGCAAGTAGC AGTGTAAAGA TATATAGAGA TAATTCTCTT
GTAGATACAA AAACTACAGA TGATCTTGGT TTTTATTATT TTTATCCCAC TCAAACAGGA
ACTTATCATG TTGAGTTTGT AAAGCCTACA AGTAATTCTA ATGCAGATAA ATTAACAAGA
GCAGCTTTAT CAATACCACA AGGACAAGTA AATTCAGATG ATATTGTTCC AGTCAACTCT
GGGAGATGGG TAAGAAATAT AGAAATCACT ACAGCATGCG AGTTTCATAC AGAAATAGAT
GGCCTTTTAA TTGATCCTGC AGGTGTTATC TATGATGCGA CCACGCGCCA ACCAGTCTCA
GGAGCTACTG TCAGACTTTT ATATAATGGA GAATTAGTTA ATAACGACTG GCTGGACGAC
AGTGGTGGTA AAAATTCACA AATAACGAGT TCAGATGGAC AATACAGTTT CACTCTTAAG
GCAGACTCAG CTGCAGACGG TACATACACG ATAGAAGTTC TACCTCCAAC TGCTTATAAA
TTTCAAAGTT CTCAAATTCC AGTAGAGGGT GATACATATT CACCCCAATT AGGAGGATCA
GTAGAAGAGA TTCAAGATCA GGAAGAAGCT CCTGCATCAG ATCAAGATAC AACTTATTAT
TTATCATTTT CATTCGTATT TACTAATGAA GCTGCCACCA CGTCAAATGG AGTAATTAAT
AATCATATTC CTATTGATCC AGCTGTAGAT CCCACAACCA AAGCAGATGT TAATGGTTTG
GTAGAGGCAT GGACTAATGC GGCTATTCGT TTTAACAAAT CAAGTGTCAA AGCTGTTGAT
AAACGATTTG ATTGGTTAAG AAGTAATCAA AATTCGGAAA AGAAATCTCA TCAAGGTATA
AATATTTCAT TTGATAATCA ATTATTAGAA AAGGCTCTAA ATGGTTCTTC AAAAAGATTC
AAAGACTTAA ATTATAGAGA TATAGAAAGT TGGGCTAGAT CTAATTGGTC TAATGAGAGA
CTAAAAAAAG AATCAGATCA GGTTTTTAAT GATCTTATTG ATAACTCTGT AGATCTTGCT
TTTGCCGAAT TACGAGAAAA AACATTTAAG CCGAATCTGA ATCCAACCGG AGGTGAATTA
ATTGGTAACT GGTCAGTTTG GACTAATGGT AAGATTCTTT TTGGAAATAA AGGTATTAGT
TCAAAATCAT CAGAACAAGA TATCAATAGC TTATTTTTAA CCTTGGGGAT TGATAAACCC
TATAAAGAGA ATGGTTTATT TGGAGTTGCT TTCAATTATG GAAAAGATGA TATAAGTGTA
GGCAATGCAG GGAGTGGTAT TGATTCTACA AACTTAGGCT TTAATTTTTA TTCTTCAAAT
CTTCTAAAAG ATAAATTTCC TATAGAATCT CAAATCGGTT TTGGAAAGAT GGATATGAAT
ACGAAAAGAA TTGATAATTC TACTTCTCAC ATAGGAGATA GGGATGTATA CATGATTTTC
GGCTCTGCGA AGATTTTGGC AGAGCCTTTT AAAATCAAAA ATTTTCAATT AACTCCTTAT
GGAAGATTAG ATTTGGCTCA TATTAATTTA AAGGCCTTTT CTGAATCTGG AAGTAGTCTC
GCACTCTCAT TTAAAGATCA AACTGTTAAT AGAAAAATGG TCTCTTTAGG AGTAAATGTA
GATAGGGATT TTATATTTGA AAATTGGAGA TTAAAACCAT TTTTAGGAAT ATCTTATGGT
TATGATTTCA CCGGAGATTC GATTGTAGAT ATGAATTATG TTGGTGACTC TCAAAATTAC
AGAATTATTC TTGATGAATT TAGTTCAAAT AATTGGAACA CAAATATTGG TTTTGAGTTT
TTTAGGGATA ATGATTGGTC AGGAAGTATT AGTTATGAGT ATGAAAAAGC AGGTTCTTCT
TCTCATATAA ATTCCTATCA ATTTAATATT TCATGGTTCT TTTAA
 
Protein sequence
MNFKNRRKIL SFIYRYSLRS FFLLILFNVN KSLANNCQNI GEIGTSGSCN GKLIVSRQNL 
LDAISDGSYA VIGPGNVSYT FAEGGTGDIY TGNINDFSSL FKGKKTFNQD IGYWDTSSAT
NMSEMFSNAR RFNQDISNWD VSNVTKMNRM FLNARYFNQD INGWDVSNVE QINLMFRNAH
RFNQSLNSWD VGNVTQMAHI FRSAKAFNGN ISAWNTSKVK NFIAVFDGAK SFNQDISNWD
VSSGTRMNHF LRNNAVFNKD LSSWDVRKFR SEPSHFAPNL LTAGGVKPCW GLNGCASADL
IPVLSSYSPN NFDVSHGSNL DLELNFNMAV ELVSKKSNVI LHKMSGSNLK KVATYNLLKS
DKVSFSDDKT KITINIGPKI TDNTKYVVQI RPGSIKSSSS GAYFQGIQPE YQKSGSIWFS
TGSNDQVLDI IGTTPSSGSN SLETENPQII IRFSEDIALG TGNVTLKKYS DDSTVRAFNV
ANSTDQEDLQ INDTDLTIKL VDTNGDTLVV GSTKYYLQVD ATAIDNAGSS KSFAGISNKD
AYSYTTISAS NCGAITGQAK YWKGKGAASS SVKIYRDNSL VDTKTTDDLG FYYFYPTQTG
TYHVEFVKPT SNSNADKLTR AALSIPQGQV NSDDIVPVNS GRWVRNIEIT TACEFHTEID
GLLIDPAGVI YDATTRQPVS GATVRLLYNG ELVNNDWLDD SGGKNSQITS SDGQYSFTLK
ADSAADGTYT IEVLPPTAYK FQSSQIPVEG DTYSPQLGGS VEEIQDQEEA PASDQDTTYY
LSFSFVFTNE AATTSNGVIN NHIPIDPAVD PTTKADVNGL VEAWTNAAIR FNKSSVKAVD
KRFDWLRSNQ NSEKKSHQGI NISFDNQLLE KALNGSSKRF KDLNYRDIES WARSNWSNER
LKKESDQVFN DLIDNSVDLA FAELREKTFK PNLNPTGGEL IGNWSVWTNG KILFGNKGIS
SKSSEQDINS LFLTLGIDKP YKENGLFGVA FNYGKDDISV GNAGSGIDST NLGFNFYSSN
LLKDKFPIES QIGFGKMDMN TKRIDNSTSH IGDRDVYMIF GSAKILAEPF KIKNFQLTPY
GRLDLAHINL KAFSESGSSL ALSFKDQTVN RKMVSLGVNV DRDFIFENWR LKPFLGISYG
YDFTGDSIVD MNYVGDSQNY RIILDEFSSN NWNTNIGFEF FRDNDWSGSI SYEYEKAGSS
SHINSYQFNI SWFF