Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_01521 |
Symbol | sms |
ID | 4912700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 141429 |
End bp | 142781 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640159718 |
Product | DNA repair protein RadA |
Protein accession | YP_001090376 |
Protein GI | 126695490 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.268386 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCA AATTATCAAC TTTTATTTGT CAGAATTGCG GATCTGAAAC TTCTCAATAT TTTGGTAGAT GCTTAAATTG CAACTCATGG AATTCCATTG TTGAAGAAAT AAAAAGTAAA AGATCTAATT ATCAAGAAAT TAAAAATAGT AAAAAATCTA TATCATTTAA CGAGATTTCA TCAAAAAAAA TATCAAGATT TACAAGTGGT TTTATGGAAT TTGATCGAGT ACTTGGAGGT GGAATCGTTC CTGGATCTGT TGTTTTACTT GGAGGAGAAC CAGGCATAGG CAAAAGCACA ATAGTTCTTC AATCAGCAGG AAAAATATCT CTAAATGAGA AAGTTTTATA TATAACTGCA GAAGAATCTT TAGAACAAGT AAAAATTAGA TGGGAAAGAT TAAATCAAAA TAGTATTGAT TTAAAAATTT TTGCAGAAAC CAATTTATCC CTAATTATTG AAGAAATCAA ACGTTTAAAT CCAAATTTCG CAATTATTGA TAGTATTCAA GCCATCCACA ATCATGAAAT GCAAAGTGCT CCAGGATCGG TTTCTCAAGT TAGAGCCTGT TCATCTGAAT TACAAAATCT TGCAAAAGAA AATAATATTG CGCTTCTAAT AATTGGTCAT GTAACCAAAG ATGGTGCTTT AGCTGGCCCT AAAACTCTAG AGCATTTAGT TGATACAGTA ATAAACTTTG AAGGAGATAA TATTTCCTCA CATAGATTAC TAAGAAGTAT AAAAAATCGA TTTGGATCTA CCTTCGAGAT TGGAATTTTT GAAATGCTTG ATGAGGGTTT ACGAGAGATA AAAAACCCAA GTTCAATTTT TACAAATAAA GAAAATATTT CAGGTGTAAC AACTACTATT ACAAATGAAG GCACTCGACC ATTAGCAGTT GATATACAAG CACTGGTAAA TAAAACTTTC TACAGTAACC CAAGAAGGAC TACCACTGGA ATAAGTATCA ATAGATTGCA TCAAATCCTA GCTGTAATTG AAAAACATGT AGGGATAAAA TTATCTGAAT TTGATTGTTA CATTGCTACT GGTGGGGGTT TTGAGATTAA TGATCCTTCA TCTGACTTAG GTGTAGCAAT ATCAATTTTA TCCAGTTTGA AAAATATTCC TCCTTTGACT AGTTGCACAT TTATTGGAGA ATTGGGTTTG AGCGGTCAGG TTAGAAAATC TAATAACCTT CGAACAAAGA TAGAAGAAGC TGTAAGACTA GGGATCAAAA ATATCGTAGT GCCAAAACTA GAGGAGGAAC TAAATAATAA TTTTCAAAAT TTAATAAATA TCAAAGAGAT ATCCAATATT AAAGAAGCAG TTGACTATTC TTTATCAGTT TAA
|
Protein sequence | MSSKLSTFIC QNCGSETSQY FGRCLNCNSW NSIVEEIKSK RSNYQEIKNS KKSISFNEIS SKKISRFTSG FMEFDRVLGG GIVPGSVVLL GGEPGIGKST IVLQSAGKIS LNEKVLYITA EESLEQVKIR WERLNQNSID LKIFAETNLS LIIEEIKRLN PNFAIIDSIQ AIHNHEMQSA PGSVSQVRAC SSELQNLAKE NNIALLIIGH VTKDGALAGP KTLEHLVDTV INFEGDNISS HRLLRSIKNR FGSTFEIGIF EMLDEGLREI KNPSSIFTNK ENISGVTTTI TNEGTRPLAV DIQALVNKTF YSNPRRTTTG ISINRLHQIL AVIEKHVGIK LSEFDCYIAT GGGFEINDPS SDLGVAISIL SSLKNIPPLT SCTFIGELGL SGQVRKSNNL RTKIEEAVRL GIKNIVVPKL EEELNNNFQN LINIKEISNI KEAVDYSLSV
|
| |