Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_12591 |
Symbol | |
ID | 4911424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 1066441 |
End bp | 1070085 |
Gene Length | 3645 bp |
Protein Length | 1214 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640160848 |
Product | hypothetical protein |
Protein accession | YP_001091483 |
Protein GI | 126696597 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain [TIGR02167] bacterial surface protein 26-residue repeat |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTA AAAACAGAAG GAAGATATTA TCTTTCATTT ATAGGTATTC ACTTAGAAGC TTTTTTTTAT TGATTCTATT TAATGTCAAT AAATCACTAG CTAATAATTG CCAGAACATT GGTGAAATTG GAACTTCTGG CAGTTGTAAT GGAAAACTAA TTGTTAGCAG ACAAAATCTT CTCGACGCAA TCTCAGATGG AAGCTATGCC GTTATTGGTC CAGGCAATGT TAGTTACACA TTTGCTGAGG GTGGTACTGG TGATATTTAT ACAGGAAACA TAAATGATTT CTCTTCATTA TTTAAAGGTA AAAAAACTTT CAATCAAGAT ATTGGGTATT GGGATACAAG CAGTGCCACA AATATGTCTG AAATGTTTTC TAATGCCAGG AGATTCAATC AAGACATCAG TAATTGGGAT GTAAGTAATG TAACTAAAAT GAACAGAATG TTCTTAAACG CAAGATATTT TAATCAAGAC ATTAACGGTT GGGATGTAAG TAATGTTGAG CAAATAAATC TCATGTTTAG GAATGCGCAT AGGTTCAATC AATCTTTAAA TAGTTGGGAT GTAGGGAATG TGACACAAAT GGCTCATATC TTTAGGAGTG CAAAAGCATT TAATGGCAAT ATTTCTGCTT GGAACACCTC AAAAGTGAAA AATTTCATAG CAGTTTTTGA TGGTGCAAAA AGTTTTAATC AAGACATAAG TAATTGGGAT GTCAGCTCTG GAACAAGAAT GAATCATTTC CTAAGGAACA ATGCTGTTTT CAATAAGGAT CTATCTAGCT GGGATGTGCG CAAATTTCGC TCTGAGCCTA GTCATTTTGC TCCAAATTTA TTAACTGCGG GAGGAGTAAA ACCATGTTGG GGATTAAATG GATGCGCATC TGCAGATTTA ATTCCGGTTC TAAGTTCCTA TTCACCAAAT AATTTTGATG TTTCTCATGG AAGCAATCTT GATTTGGAGT TGAATTTTAA TATGGCAGTT GAACTGGTAA GTAAAAAAAG TAATGTTATC CTTCACAAAA TGAGTGGAAG CAACCTAAAA AAAGTTGCAA CATATAATTT ACTCAAATCA GACAAAGTTA GTTTCTCAGA TGACAAAACA AAGATTACAA TCAATATTGG ACCTAAAATT ACAGACAATA CAAAATATGT TGTCCAGATA AGACCTGGAA GTATCAAATC CTCCTCTTCA GGGGCATATT TCCAAGGTAT CCAACCTGAA TATCAAAAAT CAGGAAGTAT ATGGTTTTCG ACAGGCAGTA ATGATCAAGT ACTTGACATC ATAGGAACCA CTCCATCAAG TGGATCAAAT TCATTAGAGA CTGAAAATCC TCAGATAATA ATCAGATTCA GTGAAGATAT TGCTCTTGGT ACAGGGAATG TAACCCTAAA AAAATATTCT GATGATTCTA CTGTCAGAGC TTTTAACGTT GCCAATTCAA CAGATCAGGA AGATTTACAA ATAAACGACA CTGATTTGAC CATAAAATTA GTAGATACTA ATGGTGATAC ATTAGTTGTT GGATCCACCA AATACTATTT GCAAGTAGAT GCAACTGCAA TTGATAATGC AGGCTCATCT AAATCATTCG CAGGAATTAG CAATAAAGAT GCATATTCAT ACACAACTAT TTCTGCCAGT AATTGCGGTG CAATAACTGG TCAGGCAAAA TATTGGAAAG GTAAAGGAGC AGCAAGTAGC AGTGTAAAGA TATATAGAGA TAATTCTCTT GTAGATACAA AAACTACAGA TGATCTTGGT TTTTATTATT TTTATCCCAC TCAAACAGGA ACTTATCATG TTGAGTTTGT AAAGCCTACA AGTAATTCTA ATGCAGATAA ATTAACAAGA GCAGCTTTAT CAATACCACA AGGACAAGTA AATTCAGATG ATATTGTTCC AGTCAACTCT GGGAGATGGG TAAGAAATAT AGAAATCACT ACAGCATGCG AGTTTCATAC AGAAATAGAT GGCCTTTTAA TTGATCCTGC AGGTGTTATC TATGATGCGA CCACGCGCCA ACCAGTCTCA GGAGCTACTG TCAGACTTTT ATATAATGGA GAATTAGTTA ATAACGACTG GCTGGACGAC AGTGGTGGTA AAAATTCACA AATAACGAGT TCAGATGGAC AATACAGTTT CACTCTTAAG GCAGACTCAG CTGCAGACGG TACATACACG ATAGAAGTTC TACCTCCAAC TGCTTATAAA TTTCAAAGTT CTCAAATTCC AGTAGAGGGT GATACATATT CACCCCAATT AGGAGGATCA GTAGAAGAGA TTCAAGATCA GGAAGAAGCT CCTGCATCAG ATCAAGATAC AACTTATTAT TTATCATTTT CATTCGTATT TACTAATGAA GCTGCCACCA CGTCAAATGG AGTAATTAAT AATCATATTC CTATTGATCC AGCTGTAGAT CCCACAACCA AAGCAGATGT TAATGGTTTG GTAGAGGCAT GGACTAATGC GGCTATTCGT TTTAACAAAT CAAGTGTCAA AGCTGTTGAT AAACGATTTG ATTGGTTAAG AAGTAATCAA AATTCGGAAA AGAAATCTCA TCAAGGTATA AATATTTCAT TTGATAATCA ATTATTAGAA AAGGCTCTAA ATGGTTCTTC AAAAAGATTC AAAGACTTAA ATTATAGAGA TATAGAAAGT TGGGCTAGAT CTAATTGGTC TAATGAGAGA CTAAAAAAAG AATCAGATCA GGTTTTTAAT GATCTTATTG ATAACTCTGT AGATCTTGCT TTTGCCGAAT TACGAGAAAA AACATTTAAG CCGAATCTGA ATCCAACCGG AGGTGAATTA ATTGGTAACT GGTCAGTTTG GACTAATGGT AAGATTCTTT TTGGAAATAA AGGTATTAGT TCAAAATCAT CAGAACAAGA TATCAATAGC TTATTTTTAA CCTTGGGGAT TGATAAACCC TATAAAGAGA ATGGTTTATT TGGAGTTGCT TTCAATTATG GAAAAGATGA TATAAGTGTA GGCAATGCAG GGAGTGGTAT TGATTCTACA AACTTAGGCT TTAATTTTTA TTCTTCAAAT CTTCTAAAAG ATAAATTTCC TATAGAATCT CAAATCGGTT TTGGAAAGAT GGATATGAAT ACGAAAAGAA TTGATAATTC TACTTCTCAC ATAGGAGATA GGGATGTATA CATGATTTTC GGCTCTGCGA AGATTTTGGC AGAGCCTTTT AAAATCAAAA ATTTTCAATT AACTCCTTAT GGAAGATTAG ATTTGGCTCA TATTAATTTA AAGGCCTTTT CTGAATCTGG AAGTAGTCTC GCACTCTCAT TTAAAGATCA AACTGTTAAT AGAAAAATGG TCTCTTTAGG AGTAAATGTA GATAGGGATT TTATATTTGA AAATTGGAGA TTAAAACCAT TTTTAGGAAT ATCTTATGGT TATGATTTCA CCGGAGATTC GATTGTAGAT ATGAATTATG TTGGTGACTC TCAAAATTAC AGAATTATTC TTGATGAATT TAGTTCAAAT AATTGGAACA CAAATATTGG TTTTGAGTTT TTTAGGGATA ATGATTGGTC AGGAAGTATT AGTTATGAGT ATGAAAAAGC AGGTTCTTCT TCTCATATAA ATTCCTATCA ATTTAATATT TCATGGTTCT TTTAA
|
Protein sequence | MNFKNRRKIL SFIYRYSLRS FFLLILFNVN KSLANNCQNI GEIGTSGSCN GKLIVSRQNL LDAISDGSYA VIGPGNVSYT FAEGGTGDIY TGNINDFSSL FKGKKTFNQD IGYWDTSSAT NMSEMFSNAR RFNQDISNWD VSNVTKMNRM FLNARYFNQD INGWDVSNVE QINLMFRNAH RFNQSLNSWD VGNVTQMAHI FRSAKAFNGN ISAWNTSKVK NFIAVFDGAK SFNQDISNWD VSSGTRMNHF LRNNAVFNKD LSSWDVRKFR SEPSHFAPNL LTAGGVKPCW GLNGCASADL IPVLSSYSPN NFDVSHGSNL DLELNFNMAV ELVSKKSNVI LHKMSGSNLK KVATYNLLKS DKVSFSDDKT KITINIGPKI TDNTKYVVQI RPGSIKSSSS GAYFQGIQPE YQKSGSIWFS TGSNDQVLDI IGTTPSSGSN SLETENPQII IRFSEDIALG TGNVTLKKYS DDSTVRAFNV ANSTDQEDLQ INDTDLTIKL VDTNGDTLVV GSTKYYLQVD ATAIDNAGSS KSFAGISNKD AYSYTTISAS NCGAITGQAK YWKGKGAASS SVKIYRDNSL VDTKTTDDLG FYYFYPTQTG TYHVEFVKPT SNSNADKLTR AALSIPQGQV NSDDIVPVNS GRWVRNIEIT TACEFHTEID GLLIDPAGVI YDATTRQPVS GATVRLLYNG ELVNNDWLDD SGGKNSQITS SDGQYSFTLK ADSAADGTYT IEVLPPTAYK FQSSQIPVEG DTYSPQLGGS VEEIQDQEEA PASDQDTTYY LSFSFVFTNE AATTSNGVIN NHIPIDPAVD PTTKADVNGL VEAWTNAAIR FNKSSVKAVD KRFDWLRSNQ NSEKKSHQGI NISFDNQLLE KALNGSSKRF KDLNYRDIES WARSNWSNER LKKESDQVFN DLIDNSVDLA FAELREKTFK PNLNPTGGEL IGNWSVWTNG KILFGNKGIS SKSSEQDINS LFLTLGIDKP YKENGLFGVA FNYGKDDISV GNAGSGIDST NLGFNFYSSN LLKDKFPIES QIGFGKMDMN TKRIDNSTSH IGDRDVYMIF GSAKILAEPF KIKNFQLTPY GRLDLAHINL KAFSESGSSL ALSFKDQTVN RKMVSLGVNV DRDFIFENWR LKPFLGISYG YDFTGDSIVD MNYVGDSQNY RIILDEFSSN NWNTNIGFEF FRDNDWSGSI SYEYEKAGSS SHINSYQFNI SWFF
|
| |