Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_05491 |
Symbol | |
ID | 4778137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 531633 |
End bp | 533921 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640086054 |
Product | outer envelope membrane protein |
Protein accession | YP_001016566 |
Protein GI | 124022259 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03304] outer membrane insertion C-terminal signal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.445832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCAGCT TCCTATCTAG CGGACCCTCG GATGCCTTGC GGCGAGGTGC ATTTGGCCTG GTTCTGGCCT TGCCGCTGCT TGCTGCTCCT GCGCGCGCTC AGACACCCGA GCAGGACACT CAGCCCCCAG CCAGGGAGAT CTTGGTTAAC GATGATGTGC AGGTGGAGGA GGTGGCCACT CCTGAAGGCG TCGTCGAGGA AAGCCTGGAG GTCGAGCAGA TTCAGGTCAC CCCTGCTGAG CAGCAAGAGC TGCCTCTCAA CGGATCTGAA CAGCTGCCTG AGGAGCCTCG TGTCTTGATC ACCGAGGTGA TCATCGAAGG GATTTCAGGC CATCCGGAGC AAGAGCGTGT TGAACTCGCC GCTTATGACG CCATGGTTGT GCGTCCTGGT AGTCGTGTGA CTCGAGATGA GCTCAAACGT GATCTCGATG CGATTTACTC CACGGGCTGG TTTTCTGATG TGCGCATTGA GCCGATTGAC GGCCCTTTAG GGGTTCAGCT CGTGGTGCAG GTGCAGCCCA ATCCATTGCT CACCAAAGTG GAACTTGATC CACCTGATGT TGAGCTTTCT GAATCTGTGA TTGAGGAAAC ATTCAGCCCC GACTACGGCA GGACTCTCAA CCTCAATGAA CTGCAGGCTC GTATGAAGGA GTTGCAGCGG TGGTACGCCA ACGAGGGCTA TTCCCTGGCT CGGGTCACCG GGCCCACGCG CGTTTCTCCA GAGGGAGTGG TGCAGCTCAA GGTGATCCAG GGCACGGTGG CAGGGGTTGA AGTGCAGTTC CTTAACAAGG AAGGTGATTC CACCGATGAG AAGGGTGAGC CGATTAATGG CAAGACCAAG CCCTGGGTGA TTACACGAGA GATATCAATC AAGCCTGGCG AAGTGTTTAA TCGAAATCAG CTTGAGGCTG ATATTAAGCG TCTCTATGGG ACATCTCTTT TTAGTGATGT CAAAGTCACC TTGAAACCGG TTGCTGGTGA ACCAGGCAAT GTGACAATCA TTCTTGGCAT TGTTGAGCAG TCCACCGGTT CGTTGTCAGG TGGTCTGGGT TATAGCCAGA GTCAGGGTGT TTTTGGCCAG ATACAACTCC AGGACAGCAA TCTTTTGGGT CGTGCCTGGA ATATGGCATT GAACATTACT TATGGTCAGT ATGGAGGCTT GGGAAGTATT ACATTTACCG ATCCATGGAT TAAGGGTGAT GCTCACCGCA CCTCATTCCG TACCTCTTTA TTCCTCAGTC GTGAGGTTCC GCAAGTTTTT CAGAGCCAAA ATAATGGCAA TATTCGCACC GTTAAAGATT ACTATGATGG CAATTCAAGC TACGCTTATC AAATTAATAA GACTGATAAC CCTGCTGGTC GCAAATTTGA TTCCGTTTCA AAGGCTGAGA GTGAATATCC ACAGTACAGC TGGTTTGACT ATGAGGGTAA CTCTGTCGCG TTGCAAAGAA TTGGCGGCAA TATAGTTTTT GCAAGGCCTC TAAATGGAGG TGACCCTTAT AAGAAGGCGC CATGGAATGT CTTGGCTGGT TTGAATATTC AAAAGGTTCG CCCGATCAAC TTCTCCGGTG ATAGCCGTCC TTATGGTGTC GCAAGTGATG ACATTAAGCA TGGCCAAGTG CCTGATGATG ATGTGATTTG TATCGCGTTT GATTGTGCAG ATGAAAACAA TCTTCTTGGT GTAAGAGTTG CGGCTACTTA TAATAATTTG AATGATCCTC GTAATCCAAC TTCTGGCAAC TTCTTTAGTT TCGGCACTGA GCAATTTGTA TCTGTTGGGG AGCATTCACC GACCTTTAAC CGTCTTCGGA CTAGCTATAC ACATTTTATT CCAGTTAATT GGTTGAAACT TGCCAAGGGC TGCCGTCCTA AGCCAGGGGA ACCGGAAAAT TGTCCTCAGG CCCTTGCTTT TCAGGTTAAG GCAGGCACAG TTTTAGGCGA TCTGCCTCCT TATGAAGCCT TCTGTCTTGG TGGATCTAAT TCCGTAAGAG GTTGGAGTGA TTGTGATTTA TCTGTGGGAC GAAGCTTTGT TGAAGCGACG ATTGAATATC GATTCCCAAT TTGGAATATC GTCTCAGGTG AGGTTTTCGT TGATGGCGGT ACTGATTTAG GTTCCCAAGA GAATGTTCCT GGTAAGCCAG GCAAACTTCT GGACAAGCCT GGTTCTGGTT TTTCGATCGG TACTGGTCTG ATTGTTACCA CTCCAGTAGG GCCATTACGC CTTGAGGTGG CAACACAGGA TTTCACCGAT GAGTGGCGCT TTAATCTCGG GGTTGGCTGG AAGTTCTAG
|
Protein sequence | MASFLSSGPS DALRRGAFGL VLALPLLAAP ARAQTPEQDT QPPAREILVN DDVQVEEVAT PEGVVEESLE VEQIQVTPAE QQELPLNGSE QLPEEPRVLI TEVIIEGISG HPEQERVELA AYDAMVVRPG SRVTRDELKR DLDAIYSTGW FSDVRIEPID GPLGVQLVVQ VQPNPLLTKV ELDPPDVELS ESVIEETFSP DYGRTLNLNE LQARMKELQR WYANEGYSLA RVTGPTRVSP EGVVQLKVIQ GTVAGVEVQF LNKEGDSTDE KGEPINGKTK PWVITREISI KPGEVFNRNQ LEADIKRLYG TSLFSDVKVT LKPVAGEPGN VTIILGIVEQ STGSLSGGLG YSQSQGVFGQ IQLQDSNLLG RAWNMALNIT YGQYGGLGSI TFTDPWIKGD AHRTSFRTSL FLSREVPQVF QSQNNGNIRT VKDYYDGNSS YAYQINKTDN PAGRKFDSVS KAESEYPQYS WFDYEGNSVA LQRIGGNIVF ARPLNGGDPY KKAPWNVLAG LNIQKVRPIN FSGDSRPYGV ASDDIKHGQV PDDDVICIAF DCADENNLLG VRVAATYNNL NDPRNPTSGN FFSFGTEQFV SVGEHSPTFN RLRTSYTHFI PVNWLKLAKG CRPKPGEPEN CPQALAFQVK AGTVLGDLPP YEAFCLGGSN SVRGWSDCDL SVGRSFVEAT IEYRFPIWNI VSGEVFVDGG TDLGSQENVP GKPGKLLDKP GSGFSIGTGL IVTTPVGPLR LEVATQDFTD EWRFNLGVGW KF
|
| |