Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_01331 |
Symbol | |
ID | 4776391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 144261 |
End bp | 147179 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640085632 |
Product | hypothetical protein |
Protein accession | YP_001016153 |
Protein GI | 124021846 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCCAA GCTTATTGCT CCGCAATCAG ATAGCTGAGC ACAAAAGGCC TGCTTATGAC TATTTTGAAG ATATAGATAT AGTTGATCTC CAGGATAAAA CCCTTGTTAA ATTAGATCGT CATTGGAGTA GTTATAAGGC TTCATGGCCT CTTGCCGATA TAAATGAAGG CTTCTCAAGT AATAGCCATG TTCGTGACTT TGAAGAAAGC TTGGTTTTAT TGTTGGAGGC TCGAATGGCT TTTTGTCAGT CAGGCGATGA TTCTTCTATG TGGGTTGTTA AGGATCCTCG TACTCCCATA CTTCTTGCAA GTTGGTTACG CGTCATGGAA TGGCTCCAAA TTGAGCCTGT ATTTATAATT GTCCATCGTA ACGCTTCTGA CAATATCCAA TCTTTCAGCA AAAAGGGACA GGTCCCACAG CGTTGGGCAG AGGCTCTTTG GCAGCAAAGT TATGTGCAGA TTGGGAAATC CATCCCTTCT GCATCTTCTG TGTATTCTCT TGATTTCTCA CAGATATTAG AGAATCCTCT GCATGTTGCT CAAGAACTTA AAAACTTCCT AGGCATGCAT ACAGATCAAC CATTAGTTCC AACTTTAAAG AAAGCAGTAG ATCCCTCCCT CCCCTCTAAG CATTCAGAAT ACAAGCTTAG TTCTATTAGT GAAAAGATCG AGTTGTGCTT AAAACAGTCT CGCTTTTCGG ATTTACCTTC ACCTGATGAT ATTACGCTCG AGGCTTATAA AATACAAGCT GATTTGACCC CAGTTTCACA ACTAACTCTA CATAATTTTG GCATAGAACT TCGAAAAAAT AAAGCTATGT CTCACTCCAG TCGTAAGCGA ATTTGTATTC TCACAGCAGA GCTCCAGGGA TATGGTCCGA GTGGAGGGAT TGGAACGGCA ATGTTGGAGT TGGCGATAGA GCTTGTCAGT TCTGGACACC TAGTTGAGGT TTGGTTGGTT GGTAGTAGTA ATGATCCCAT ACCCTCAAGT CGACTTGACT CTATTCATAT AAGGCATTTA CCTGGAGAAA CTGTAGATCA AGACCCTGCT AACTTTCGGC AACAAATAGC AGAGGCTGTA TTAACGGAAT CTTTCGATAT TATTCATTGT CATGATTGGT TAGGACTTGG TGCTTGCTTG CATTTATCAA CCCTTGAAAC CGAATCCCCA ATTGTAATAT GTGGCTTGCA TGGCCCAACT CAGTGGGTAA GAGAAGGAAG TCCTTCGATT TCCAATTGGA CCAAAAGAGA TTCAACAATC ATTGAACTGG AATGGCAGGC GATCATCAAT GCAGATGTAC TGTTTAGTCC CTCTGCATAT ATGAAAAACT GGGTATCAAA ACACCTTGGA AATAGAAAAT ATTATCCAGA TATTCATGTG CAGTTGAATT GTCCTAGCGT TGCCCCAAAT GCAAAACTAA GTCGTAATAT TGATCTTCCT CCAGAAAGCA AGAGTAGTTT GATCTTCTTT GGTCGATTAG AGGAACGCAA AGGAATAGTT TTGTTTTTAG ATGCTCTATC AATTCTTGGC CTGCAGACCC ATCCAATTTA TTTTATAGGT GCTGACGCAC CTTTAGATGG TTGTTGGGCG AGTCAATTAG TAGAGAGACG ACTAAAAGAA AGCGGCCAGC TCTATCAGTG GCTGCCTGAT TTGAATCGCG ATGAAGCCCA TGCTGTACTA CATGCAATAG GCGGAATTGT TGTAATTCCC TCTTTGATTG AAAATAGCCC TTACACCGTT CAAGAATTGC TAGATACAAC ATTGAGTGTA GTTACAACAA ATGTAGGTGG AACCCCGGAA CTGGTAGCTA ATGCTCAAGC AACTTTGTCA GAGCCCAACC CTCGCGACTT GGCTAACAAG ATTCAAGATG CTCTGGCGGA CAACATTGAT TCAAAATCTA TTTTTAAAAT CAAATCAATT GTTGATAAAT CGAGGATTCG TTTGAGTTGG CAAGAGTTTC ACTCTCGTTT GCCATCTTGT GAATTTACTG TTCCAAGATG GAACCCTAAA GAAGCCATTG TGCTTATTAC TTTAGATTCA TGTCGCCTTG ATACTTTTCA GTCTAGCTCT ACCGTTAACA TTAGCAAGAT CGGGCCTTTG CATAAGGCGA AATCCCCAAG CTATTTTACT TATGCAAGCC ATGCAGCAAT GTTTATGGGT TTCTTGCCTA GCACCTTAGA TCCAGTTGGC TTCGTTAACT CTAAATTTGC CAAAGTATTT CGGCTTTCTC ACTCAGGTTT CCAGGCTTCA CGCACAGAAG AAAGCTTTGA ACTTTCTGGT AATTCGATTA TTACTGGCCT GAGGCGAAAA GGTTATTTCA CTATTGGAAC TGCATCCGTT AATTGGTTTG ACCCAGCTAC AGAAACTGGA CAACAGCTTG TTAAAGATTT TGATACTTTC TGGTTTTCTG GCAACACCTG GAGCTTGAAC CGTCAGTTAT TATGGATTGA TAGTCAGTTG CAACAAGAGC TTGATCGCCC ACCTTTTATT TTTTTGAACG TTGGCGAAAC TCATGTGCCT TATTGGCATG AAGGTGCTTC GTGGTCTAGG GACGACCACC CATGCATACC ATTTCAGACA CAGGATCGTC GTAAAGATTG CCAGGAACGT CAGCGTGCTT GCCTCGAGTT CATCGACAAA CAACTTGGTT CTCTTCTTGA ACGTTTTAGT GAATCAACCG TAATTATTTG TTCGGATCAT GGTGATTGCT GGGGGGAGGA TGGCCTATGG GAGCATGGTA TTTCCCATGA AAAAACTCTT TCTGTTCCAT TATTGATGAG AATACGGGGC TGCCCGATCC CACCTCCGAC ACCTCCGCTT AGTTTTCGAC AACGTATTTC AAATTTTGCC AGGAAACGTA TTTCAAAGCC TGTCAGGTGC AAGCTAGCCT CTGCTCTTAG AAGAACTAAA GTCCTTTGA
|
Protein sequence | MGPSLLLRNQ IAEHKRPAYD YFEDIDIVDL QDKTLVKLDR HWSSYKASWP LADINEGFSS NSHVRDFEES LVLLLEARMA FCQSGDDSSM WVVKDPRTPI LLASWLRVME WLQIEPVFII VHRNASDNIQ SFSKKGQVPQ RWAEALWQQS YVQIGKSIPS ASSVYSLDFS QILENPLHVA QELKNFLGMH TDQPLVPTLK KAVDPSLPSK HSEYKLSSIS EKIELCLKQS RFSDLPSPDD ITLEAYKIQA DLTPVSQLTL HNFGIELRKN KAMSHSSRKR ICILTAELQG YGPSGGIGTA MLELAIELVS SGHLVEVWLV GSSNDPIPSS RLDSIHIRHL PGETVDQDPA NFRQQIAEAV LTESFDIIHC HDWLGLGACL HLSTLETESP IVICGLHGPT QWVREGSPSI SNWTKRDSTI IELEWQAIIN ADVLFSPSAY MKNWVSKHLG NRKYYPDIHV QLNCPSVAPN AKLSRNIDLP PESKSSLIFF GRLEERKGIV LFLDALSILG LQTHPIYFIG ADAPLDGCWA SQLVERRLKE SGQLYQWLPD LNRDEAHAVL HAIGGIVVIP SLIENSPYTV QELLDTTLSV VTTNVGGTPE LVANAQATLS EPNPRDLANK IQDALADNID SKSIFKIKSI VDKSRIRLSW QEFHSRLPSC EFTVPRWNPK EAIVLITLDS CRLDTFQSSS TVNISKIGPL HKAKSPSYFT YASHAAMFMG FLPSTLDPVG FVNSKFAKVF RLSHSGFQAS RTEESFELSG NSIITGLRRK GYFTIGTASV NWFDPATETG QQLVKDFDTF WFSGNTWSLN RQLLWIDSQL QQELDRPPFI FLNVGETHVP YWHEGASWSR DDHPCIPFQT QDRRKDCQER QRACLEFIDK QLGSLLERFS ESTVIICSDH GDCWGEDGLW EHGISHEKTL SVPLLMRIRG CPIPPPTPPL SFRQRISNFA RKRISKPVRC KLASALRRTK VL
|
| |