Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_18431 |
Symbol | |
ID | 4775998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1601654 |
End bp | 1606090 |
Gene Length | 4437 bp |
Protein Length | 1478 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640087352 |
Product | hypothetical protein |
Protein accession | YP_001017850 |
Protein GI | 124023543 |
COG category | [S] Function unknown |
COG ID | [COG3164] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAGA AGAGTCAATC TTTGGCTAAA GGGCGATGGA AACGCCCTGC TGTTCTACTG CTAACCAGCA GCTTGGTTGT TTGGGTTGGC GCCGATCGAG TTGTTGCTGC CCTTCTTGAA CGTCTGCGTC CGCAACTGGA GCAACAGCTT TCAAAACCTC TAGGCCATCC TTTAAAAATT GGTGCCTATC AGGGCCTGCG GCCCTGGGGT ATGGCCATTG GTCCCACTGA AGTGCTGTCT GGAAGTCATG ACGATTCCAC AGCATCGCTC TCTGGTCTAA CCATCAGCCT TGCCCCAATA GCCAGCCTGT TGCGTTTGCG ACCCGTTGCC GTACTAACCC TTGAAGGATC ACGGTTAACG CTGCGTCGTA ATCACAAGGG TTTCTACTGG GTCCCTGGTC CATCCAAAGG CGAGCCTCCC CCAAAGGTTG ATCTTCAGCT GCGGTTGACT CAGCCCGCCA GGGTCCGCAT CGAACCCGCA AACTTGGAGT TCACCGCTAC CACACGGGCA GACCTTCAGT TGGCGGAGGG ATGGGCCAAT GGGTCTGTCC AATTTGTATT GCCAGATCGT GGAAGATTCT TCCTAAAGGG CCGTGGGCGC TGGGATCGTT TAGAGCTTGA GGCCCATGCT CGTTTGGACC AAATCAGCCT TAAACCATTG CAGGGGCTTT TGCCGGGAAC GCTGCCCATG CAAATGCAGG GTCAGATTGG CGGTGACCTC CAAGTGAGTC TTAATCAGGG ACGTATGGGC TGTAAGGGAT CCCTGTCATT GGTTCGATTC CAATTAGCGG GTGGTCCCCT CAAAAAGTCA CTCTCATCTC GTGAAGCCAA GATCAACTGT CGTCAAGATC GTCTTCAGTT ACCTCTCAGC CAGTGGCGTT ACGGCCCTTG GACCGCTTCT CTCAAGGGAG GTGCTCGTCT CAATCACTCC TACAACCTCG ACCTCAAGGT CAACCAGCGG CAACAAGGGC ATGCCTTCCA AGCCCGGATC AATGGACCTT GGCATGAGCC CAATCTGCAC GCCAGTGGCC GCTGGATTCT TGCCCCAAAG ATCCCCGTTG ATGGTCCGCT GCAACTGAAC TTGCAGATGC GCACCAACTG GCGCAATCCC AAGGCCTTCA GGGCCGTTAT TGACACGTTT GATGTTCGTG GTCCAGGTCT TCAGGTCCGT GCAAGGGGGC CCCTCTATCC CGAGTTAGGG CTGAGCACCC AGCGCCTGGA GTTTGCAGGC CCTGCATGGC AGCGCATCCC TGTGCTTGCT GATCTTCTCG GCAGCCAATC CTTGATCAAA GGCAAACTTC AGCTTGAGGG CCCTTCATCG AGCCCCCAAC TTCAGCTAAG CCTTGCTCAG CAAGGTAATC CTTTGCTGGA AACCTGGTCT CTGCGAGCGG GGTGGTCAGC AGATTCCGGT TTGCTGCGCT TAAGGCAGTT CAACAGTCCC CTGCTCAAAG TTGTGGCGGA CTTGCCTCTC TCAGTTGATC AAGGACGCCT ACGCAGTGGA GAACTGCAGG CCAATCTGAA TTTGAGTCCT TTCCCTTTAG CTCGTATTGG TCCGCTTCTC GGCACCTCTT TGGCTGGAAC CCTTGCTGCT TCGGGTCAGG TGAGAGGGCC ATTCTCGGCC CTTCGGCCAA ATCTTTCCCT CCGGGTGGTG AATCCAGAGG CTGGAGGGTT GCGATTGTTG GAAGATTGGC AAGGGAACTT GGCCGGACTC CCTACTGGGG GTAGCACTCT GCTGATGGAA TCAGTGGGGG CGGTGATCCC TGGGCAGCTC TCAGCTCGAT TGGGACGCGA CTGGTTGCCA CAGGAGTTGG CGATTAACCG TGGCGATGGC CGTCTCTCGC TAAACGGCAT CCCAGCTCGC TATCTCTGGG AGTTGAACAA TTTCAAAGTG GATGGAATTG AGGCGGCCCT GTCTTCAAAG CAGCGATTTG AAGGCGTTTA TGGTCAACTC AGCGGGTCAG GCAGCCTGGG CCTTCAGCCC CTGGCCATGG AAGGCCAAGT CACCATCAGC AATCCAGGGT TGATGGGTCT GCAATTTCAA CAGGCTTTGC TTCAGGGGAG GGTCGCCAAC CAGCGCTACA AGATGACTGG TGAGCTGCTA CCAGCGGATA CAGGCCAGAT CAATCTTGCA GCAGGAGGTC GCCTTGGTGG CGAACTCTCT GCTAAGGCAA AGGCACGCGG TCTAAGCGCG CGTTGGCTGA TCTCTAGTGC AGAGCAACTT TCTAATTTTA ATGATGTTCT TCCAGCTTCG ATTGGGCGAG CCCAAGATCT GGGCACTCTG TTGATGCAAA CCTTGGGAAG GTCTCTAGAT GATCAGCTCA AAGTCTTGGC AGCGGCGCAG GCTTCCGTCA ATCGTTTCGA CCAGCAAAAT CGTCGTAGCA AGATCATCCA TCCAGAAGAT CTACGTGGGC AGATTGATGC GGTGATCGAT CTGAAGGGTC CCGATCTGTC CAAACTCAAT CTGGGGCTAA AGGCCAGCGG TCATCTCTGG ACAGAAAGTG AGGATCAGGA TCACGCCCTA CAAGTCAAAC CCATTTTCGC CTCGATTCAA GGACCACTGC ATGGTGGGGA AGGGTCGTTT TCACTGCTTC ACGTTCCCTT TTCTCTGCTT TCGTTGGTTG CACCTCTACC TCAAGCACTG CGAGGCGCTC TGGGACTTTC GGGTCGATAC AACCTCAGAC GGGGAACCCA TGAAATCACA GCTGACCTAG TGATGGAAAG CGCCAGGTTG GCTGAGAGCA AATTGAGCTT GGACAAAGGG CAGATCCTCT TGAATGATGC CCTCCTGAAC CTCGACTTCG CGCTGCGAAG CTCTTCTTCC AAAGAGGCTG TAACCATTAC CGGGCAAGTG CCTCTGGATC CCTCCTTGCC GATAGACGTC AGGGTTGAGA GCCACGGCGA CGGCCTACAT TTTCTGGCTG ATTTTGCTGA AGGTGCGGTC GCCTGGAAGG GCGGTAACTC CGACCTCAAG TTGTTGTTTA GCGGCAGCCT TAGCGCTCCT CAGGCGAATG GATTTTTGGT GGTGCAAAAC GGCGAATTCG TCGTCATGGA ACAAGTTGTC AAGGGGTTGG AGGCTGCCAT GGTTTTTGAC TTCAATCGCC TTGAGGTGCA GCGTCTCAAA GCCAAGATTG GTTCGAAGGG CATCTTGCAA GGTGCGGGAT CCATTGCCTT GCTTCGCCCT GCACCTGAAG ACCAGCCCCT GACCATCGAG ATCAGTAAGT CCCGTTTTAA GTTGCCAAAG GCTGATGTAG GGGTGGCCGC CAAGCTCAAG TTCACGGGTG CCTTACTGAA ACCCTTGATT GGTGGTGAGC TCACGATCAA AGAAGGAACG ATCTCACCCG CCGGCTCAGG GCTTCTGCGA CCGATCAACT CTGCCATTCA ATCCACCAAA AGGCCTGGAG CCGGAGAGGC GATAGCGACT TCCAGCAGCC CCAAAGTTGT TAATGCCAAC ACCCTGCTTG AGGAGCAGTG GGATTTCAAG AAACCCTTGG TTTTGCTTGG ACCAGATGTA GACGTCAGTC GAAGGAAGAT GTTGAGCTCC GTGATTCCCA ACATCCCCTC TATCAGCTTC GACAACCTGC GTTTAAAGCT GGGGCCAAAT TTGCGCATCA CTGCTAATGC GCTGGCCAAT TTCAGTACAG AAGGGTTGCT CAGTTTGAAT GGCCCCCTAG ATCCAAAGCT TCAGGCTCGT GGTGTGATTC GGCTACTGAA TGGGCGGCTG AATCTGTTTA CGACGTTCTT CAGTCTTGAT CAGCGAGCTG CGAATGTCGC TGTGTTTACT CCTTCTCTGG GCCTAATCCC TTACGTTGAT GTCGCCATGA ACAGCCAGGT CTCCGACAGC ATCAGCATCG GCACCGACAG TAATGCCGCA TCAGCGAATG TATTTGATAC CAATGGCACA GGTGCTCTTG GGGCCGGAGG ACAGTTCCGC TTGATCAAAG TAATGGTGAA GGCCGAGGGA CCTGCAAATC GTCTTTTCCA AAACATTGAC TTACGCAGCT CGCCGTCACT GCCTCGTGCT CAATTGCTGG GGTTAATTGG AGGAAATTCA CTGGCAGGGT TGTCGGGAGA AGGTGGTGGT GCGGCACTAG CGACTGTGAT CGGTCAATCT CTTCTCACGC CTGTTCTTGG AACAATCTCT GATGCTTTCA GTCAACGAAT GCAAATTGCC CTTTATCCTG CATACGTCTC GCCGGTTGTG ACGAGTCAAC AAGAGCGCGT TTCTGGGCAA GTCCCACCCA CTCTCGAGGT AGTCACAGAC ATTGGTATTG ATATCACTAA GCGACTTAAC GTTTCTATTT TGGCGACACC AGACCGCAAC GATATTCCTC CGCAAGGCAC CCTTACTTAT CAAATCAGTC CCAGCATGAA TCTCTCAGGT TCTGTAGACA GCCAGGGAAT TTGGCAGAGC CAATTGCAAT TATTTTTCCG CTTTTGA
|
Protein sequence | MGQKSQSLAK GRWKRPAVLL LTSSLVVWVG ADRVVAALLE RLRPQLEQQL SKPLGHPLKI GAYQGLRPWG MAIGPTEVLS GSHDDSTASL SGLTISLAPI ASLLRLRPVA VLTLEGSRLT LRRNHKGFYW VPGPSKGEPP PKVDLQLRLT QPARVRIEPA NLEFTATTRA DLQLAEGWAN GSVQFVLPDR GRFFLKGRGR WDRLELEAHA RLDQISLKPL QGLLPGTLPM QMQGQIGGDL QVSLNQGRMG CKGSLSLVRF QLAGGPLKKS LSSREAKINC RQDRLQLPLS QWRYGPWTAS LKGGARLNHS YNLDLKVNQR QQGHAFQARI NGPWHEPNLH ASGRWILAPK IPVDGPLQLN LQMRTNWRNP KAFRAVIDTF DVRGPGLQVR ARGPLYPELG LSTQRLEFAG PAWQRIPVLA DLLGSQSLIK GKLQLEGPSS SPQLQLSLAQ QGNPLLETWS LRAGWSADSG LLRLRQFNSP LLKVVADLPL SVDQGRLRSG ELQANLNLSP FPLARIGPLL GTSLAGTLAA SGQVRGPFSA LRPNLSLRVV NPEAGGLRLL EDWQGNLAGL PTGGSTLLME SVGAVIPGQL SARLGRDWLP QELAINRGDG RLSLNGIPAR YLWELNNFKV DGIEAALSSK QRFEGVYGQL SGSGSLGLQP LAMEGQVTIS NPGLMGLQFQ QALLQGRVAN QRYKMTGELL PADTGQINLA AGGRLGGELS AKAKARGLSA RWLISSAEQL SNFNDVLPAS IGRAQDLGTL LMQTLGRSLD DQLKVLAAAQ ASVNRFDQQN RRSKIIHPED LRGQIDAVID LKGPDLSKLN LGLKASGHLW TESEDQDHAL QVKPIFASIQ GPLHGGEGSF SLLHVPFSLL SLVAPLPQAL RGALGLSGRY NLRRGTHEIT ADLVMESARL AESKLSLDKG QILLNDALLN LDFALRSSSS KEAVTITGQV PLDPSLPIDV RVESHGDGLH FLADFAEGAV AWKGGNSDLK LLFSGSLSAP QANGFLVVQN GEFVVMEQVV KGLEAAMVFD FNRLEVQRLK AKIGSKGILQ GAGSIALLRP APEDQPLTIE ISKSRFKLPK ADVGVAAKLK FTGALLKPLI GGELTIKEGT ISPAGSGLLR PINSAIQSTK RPGAGEAIAT SSSPKVVNAN TLLEEQWDFK KPLVLLGPDV DVSRRKMLSS VIPNIPSISF DNLRLKLGPN LRITANALAN FSTEGLLSLN GPLDPKLQAR GVIRLLNGRL NLFTTFFSLD QRAANVAVFT PSLGLIPYVD VAMNSQVSDS ISIGTDSNAA SANVFDTNGT GALGAGGQFR LIKVMVKAEG PANRLFQNID LRSSPSLPRA QLLGLIGGNS LAGLSGEGGG AALATVIGQS LLTPVLGTIS DAFSQRMQIA LYPAYVSPVV TSQQERVSGQ VPPTLEVVTD IGIDITKRLN VSILATPDRN DIPPQGTLTY QISPSMNLSG SVDSQGIWQS QLQLFFRF
|
| |