Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_01611 |
Symbol | sms |
ID | 4720297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 148021 |
End bp | 149373 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 640079824 |
Product | DNA repair protein RadA |
Protein accession | YP_001010477 |
Protein GI | 123965396 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.821199 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGTA AATTTTCGAC TTTTATTTGT CAAAATTGCG GATCTGAAAC TTCTCAATAC TTTGGTAGGT GCGTAAATTG TAATGAATGG AATACAATCG TTGAAGAAAG GAAAATTTCA AGATCCAAAA CAACAAATAT TACCAAAAGT AAAAAATCAA AGCTATTTCA TGAAATTGAA CTAAATACAA TATCAAGATT TACGAGTGGT TTTAAGGAAT TTGACAGAGT TCTAGGGGGT GGAATAGTAC CAGGATCTTT AGTTTTACTT GGGGGAGAAC CAGGAATTGG TAAAAGTACA ATAGTGCTGC AATCTGCAGG GATAATCTCT CTGAATGAAA AAGTTTTATA CATCACCGCA GAGGAATCTT TAGAACAAGT AAAAATAAGA TGGGAACGAT TAAATCAAAG CAGCCTAGAT TTAAAAATCT ATGCAGAAAC AAACTTATCT TTAATTATTG AAGAGATAAA AAAAATTAAG CCTAGTTTTG CGATTATTGA TAGTATCCAA GCTATTAATA ATGATGAAAT GGAAAGTTCC CCGGGCTCTG TTTCCCAGGT TCGGGCTTGC TCATCTGAAC TTCAAAACCT GGCAAAAGAA AATAATATAG CGCTCTTGAT AATTGGTCAT GTCACTAAAG AGGGGGCTCT TGCAGGTCCA AAAACTTTAG AACATTTAGT AGATGTTGTT CTTAACTTTG AGGGAGATAA TATTGCCTCA CATAGATTAC TCAGAAGTGT AAAAAATAGG TTTGGATCTA CTTTTGAAAT TGGAATTTTC GAAATGTTAG AAAATGGGTT AAAAGAAGTT GGGAACCCTA GTTCAATTTT CACAAATAAA GAAAATATAT CCGGCGTCAC AACTACGATA ACTAATGAGG GATCTAGACC ATTTGCAGTA GATATTCAAG CTTTGGTGAA TAAAACTTTT TACACTAACC CAAGACGAAC AACTACAGGA ATCAGTATCA ACAGACTACA TCAAATATTA GCAGTCATTG AAAAACATGT TGGGATTAAA TTATCAGATT ATGACTGTTA TATAGCAACA GGAGGCGGGT TTGAAATAAA TGACCCATCT TCAGATTTGG GGGTTGCAAT ATCAATCTTA TCAAGCTTGA AAAATATTCC CCCCTTAATA AATTGTTCAT TTGTAGGCGA ATTAGGTTTA AGTGGTCAGG TAAGGCAGGC AAATAATCTT CGAACCAAAA TTGATGAAGC TATTAGACTT GGTTTCAAAA ATATTTTGAT ACCTAAAACA ACTTGTGAAA TTAAAGATAA TTTTCAAACT CTCATAAAGA TTAAAGAAAT TTCAAATATT AATGAAGCTA TGAATTATGT TCTAAAAGAG TGA
|
Protein sequence | MSSKFSTFIC QNCGSETSQY FGRCVNCNEW NTIVEERKIS RSKTTNITKS KKSKLFHEIE LNTISRFTSG FKEFDRVLGG GIVPGSLVLL GGEPGIGKST IVLQSAGIIS LNEKVLYITA EESLEQVKIR WERLNQSSLD LKIYAETNLS LIIEEIKKIK PSFAIIDSIQ AINNDEMESS PGSVSQVRAC SSELQNLAKE NNIALLIIGH VTKEGALAGP KTLEHLVDVV LNFEGDNIAS HRLLRSVKNR FGSTFEIGIF EMLENGLKEV GNPSSIFTNK ENISGVTTTI TNEGSRPFAV DIQALVNKTF YTNPRRTTTG ISINRLHQIL AVIEKHVGIK LSDYDCYIAT GGGFEINDPS SDLGVAISIL SSLKNIPPLI NCSFVGELGL SGQVRQANNL RTKIDEAIRL GFKNILIPKT TCEIKDNFQT LIKIKEISNI NEAMNYVLKE
|
| |