Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20571 |
Symbol | |
ID | 4776586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1811084 |
End bp | 1813426 |
Gene Length | 2343 bp |
Protein Length | 780 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640087566 |
Product | SAM-binding motif-containing protein |
Protein accession | YP_001018058 |
Protein GI | 124023751 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2230] Cyclopropane fatty acid synthase and related methyltransferases [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.183834 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGGAT TTGGAGAAAA GAAGAAAATA AAAAATAAAC AACCCTCCAA GAAGTTTGCC CGAATACCAC CTGATCAACT CAAGGCTATC GCTTTTAAAT ATCACCAACA AGGCAATATA AATGAGGCGC AAAAAGCCTA TCAGGAATTT ATTAATAGTG GTTTAAGTGA TCCAGATGTT TTCTCTAACT TTGCATTAAT CTGTCAATCC CAAGGAGAGA TTGACAAGGC AATAAAAGTC TATAAGAAAA GCGTCAAACT ATTTCCTGGC CATGCTTTTT CACATGCAAA CTTGGGATAT CTTCTCTTTC AGATAGGAAT GTTAGATGAT GCTGAAGTGG CTATCCGCCA GGCGATTGTT ATACAGCCTA ACCTCGCCAA TGCTTATTCA TACTTGGGTT TAGTTTTACG AGAAAAAGGA AGACTAACTG ATGCAGAAGA TATAACCCGA AAGGCCATTG AACTTCAACC AGATTTGGCT GATGCATACG TAAACCTTGG TCAGATCCTA CAGAATCAAG GCAAACTAGA TGAAGCAGAA CATACAACAC GCAAGGCAAT TGAATTACAA GATGATTCGG CAAGTATATA TCTCAATCTA GGTGGTATCC TGCAAGACCA AGGTAATCTA ACTGATGCAG AAGCGAACAC AAGAAAGGCA ATGAACTTAC AGGCCGATCT GCCTGATGTC AATTTAAACC TTTCTATAAT TTTAAAGGAT CTTGGCAGAT TAGAGGAGGC TGTATTTCAT TTAACAAGAG AAATTGAGCT TTACCCACAA AACCAATCAT CATATTTGCT TCTTAACTCA CTGCTTGAGG AATCTGATCT TTCATTCTTA CCTGAAAGAC AGTCTAGAAT TTTGCTTCGT GGCCTATTAA AGAGAAATGA TATTGCACAT AAAAATCTAT TTTCAGCAAT TAATCGCTTG ATCTCAGAAC AAACACTAGA CAAAATTTCA GGCATTCATC ATGATTTATT TGATGACCCC TCTTTTCAAC AAATTCTTGC TGATGATGAG ATAATCAGTG CCCTTGGTTT GATGTTATTT ACCACAATGG CCTGGGAGAA GGCATTGACT AATATACGCA AGCAAATATG CCTTTCAATT CAAAATAATG GTTTTGATAA AAGGATTATT GATTTAACTA TTGCTCTTGC GGAGCAGTGT TTCCTAAATG AATATGTTTT TACATTTACC AAGCAAGAAT CTGATGCAAT AGAACAGTTT AAACTTTCCT ACTTGAGAAG TGATTTTGAT TTAAAAACTC TTTCTATCTT AGCTTGTTAT ATACCCATTA CACATCTTTC TGAACAGTTC CCTTTATTAA GAGGATTCAT CGATGAAAAT GAGAAATTAA ATAATTTAAA AATTATGCAA CTCGTCGAAC CCGAACGTGA ACATGATTTT GCAGTATCTA TTCCTAAATA TGGATCTATA GATGATGGTA CTTCTATTCA GGTTAAAAAG CAGTATGAAG AGCATCCTTA TCCGCGTTGG AGATATGCTT CTTATTCAAG TGAAAATATT CAAACTATAT CTTCAGCGAT TAATAATGAA ATAAACCCTA ATCGGATCTC AATCATTTTG CCCAAGCAGA GATCTCGCGT CTTAATAGCT GGCTGTGGAA CCGGACAGCA AATATTCGAT GCCCTCTCTT ATTCTAACTC TGATTTAACT GCTATCGATC TCAGTTCTTC TAGTATCGCT TATGCAAAGC GTAAGGCTCA TGAATATGGA ATTGAGCATA TAAGATTTAT AGAGATGGAT ATTCTTGATC TTCCAAAACT AAATGAAGAA TTTGATCTCA TAGAATGTAC GGGAGTTCTC CATCATATGA AGGACCCATC TGAAGGTCTG CAATCTCTAC TTACAATACT CGCATCCGAT GGAATGCTCA AATTAGGTTT TTATAGTGAG CTAGCACGTC AAGATATTGT CGAGGCTAGG AAAATTATTA AATCAGAATC TTTTGAGGCT AGTAATGAAG GCATTTGTCT ATTTAGGGAT AAACTAATCA ACGGTGAATA TCCGAACATT AGTTCAATCT CAAACTGGCC AGACTTCTAC ACAACTTCTA TGTGTAGAGA CCTTTGTTTT CATATTATGG AGCATCGTTA CTCACTAGTG ATGATAGCAT CGCTGCTTGA ACAATTTGAA CTTAGATTCT TAGGTTTTGT GTTGCCAAGT ATTGTTAAAA AAGATTATGC CAGGGCTTAT CCATCTGATG CAATGCAGAC TGACCTCGGC TATTGGCACG AGTATGAACA AGCCAATCCC AATACCTTTC GACAAATGTA CCAGTTCTGG ACGAATAGAA AGAAGATAAT GTATTACGGT TAA
|
Protein sequence | MKGFGEKKKI KNKQPSKKFA RIPPDQLKAI AFKYHQQGNI NEAQKAYQEF INSGLSDPDV FSNFALICQS QGEIDKAIKV YKKSVKLFPG HAFSHANLGY LLFQIGMLDD AEVAIRQAIV IQPNLANAYS YLGLVLREKG RLTDAEDITR KAIELQPDLA DAYVNLGQIL QNQGKLDEAE HTTRKAIELQ DDSASIYLNL GGILQDQGNL TDAEANTRKA MNLQADLPDV NLNLSIILKD LGRLEEAVFH LTREIELYPQ NQSSYLLLNS LLEESDLSFL PERQSRILLR GLLKRNDIAH KNLFSAINRL ISEQTLDKIS GIHHDLFDDP SFQQILADDE IISALGLMLF TTMAWEKALT NIRKQICLSI QNNGFDKRII DLTIALAEQC FLNEYVFTFT KQESDAIEQF KLSYLRSDFD LKTLSILACY IPITHLSEQF PLLRGFIDEN EKLNNLKIMQ LVEPEREHDF AVSIPKYGSI DDGTSIQVKK QYEEHPYPRW RYASYSSENI QTISSAINNE INPNRISIIL PKQRSRVLIA GCGTGQQIFD ALSYSNSDLT AIDLSSSSIA YAKRKAHEYG IEHIRFIEMD ILDLPKLNEE FDLIECTGVL HHMKDPSEGL QSLLTILASD GMLKLGFYSE LARQDIVEAR KIIKSESFEA SNEGICLFRD KLINGEYPNI SSISNWPDFY TTSMCRDLCF HIMEHRYSLV MIASLLEQFE LRFLGFVLPS IVKKDYARAY PSDAMQTDLG YWHEYEQANP NTFRQMYQFW TNRKKIMYYG
|
| |