Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_01501 |
Symbol | sms |
ID | 4716834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 140451 |
End bp | 141803 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640077849 |
Product | DNA repair protein RadA |
Protein accession | YP_001008545 |
Protein GI | 123967687 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.309187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGCA AATTATCTAC TTTTATTTGT CAGAATTGCG GATCTGAAAC TTCACAATAT TTTGGTAGAT GCTTAAATTG CAACTCATGG AATTCCATTG TTGAAGAAAT AAAAAGTAAG AGGTCTAAAT ATCAAGAAAT AAAAAATAGT AAACAATCTA TACCATTTAA CGAGATTTCA TCAAAAAAAA TATCAAGATT TACAAGTGGT TTTAGGGAAT TTGATCGAGT GCTTGGAGGT GGAATCGTAC CTGGATCTGT TGTTTTACTT GGAGGAGAAC CAGGCATAGG TAAAAGCACA ATAGTTCTTC AATCAGCAGG AAAAATATCT CTCAATGAGA AAGTTTTATA TGTAACTGCA GAAGAATCTT TAGAACAAGT AAAAATTAGG TGGGAAAGAT TAAATCAAAG TAGTATTGAT TTAAAAATTT TTGCAGAAAC CAACTTATCC CTAATTATTG AAGAAATCAA ACGGTTAAAT CCAAGTTTCG CAATTATTGA TAGTATTCAA GCCATCCATA ATCATGAAAT GGAAAGTTCG CCAGGATCGG TCTCTCAAGT AAGAGCATGT TCATCTGAAT TGCAAAATCT CGCCAAAGAC AATAACATTG CCCTTTTAAT CATTGGTCAT GTCACCAAAG ATGGTGCTTT AGCTGGCCCT AAAACTCTGG AGCACTTAGT TGATACAGTA ATAAACTTTG AAGGAGATAA TATTTCCTCA CATAGATTAT TAAGAAGTAT AAAAAATCGA TTTGGATCAA CCTTTGAAAT TGGAATTTTT GAAATGCTTG AACAGGGCTT ACGAGAGATA AAAAATCCAA GTTCAATTTT TACAAATAAA GAAAATATTT CAGGTGTAAC AACTACTATT ACAAATGAAG GTACTAGACC ATTAGCAGTT GATATACAAG CACTTGTAAA TAAAACTTTC TACAGTAACC CAAGACGAAC TACAACTGGA ATTAGCATAA ATAGATTACA TCAAATTCTA GCTGTTATTG AAAAACACGT AGGCATAAAA TTATCTGAAT TTGATTGTTA TATAGCTACT GGCGGGGGTT TTGAGATTAA TGATCCTTCA TCTGACTTAG GTGTAGCAAT ATCAATTTTA TCAAGTTTGA AAAATATTCC TCCTTTAGTA AATAGCTCAT TTATTGGGGA ATTGGGATTG AGCGGTCAGG TTAGAAAATC TAATAACCTT CGAACAAAGA TAGAAGAAGC TATAAGACTA GGTATCAAAA ATATCGTAGT ACCAAAATTA GAGGAGGAAA TAAATAATAA TTTTCAAAAT CTAATAAATA TCAAAGAAAT TTCCAATATT AAAGAAGCAG TTGACTATTC TTTATCAGAG TAA
|
Protein sequence | MSSKLSTFIC QNCGSETSQY FGRCLNCNSW NSIVEEIKSK RSKYQEIKNS KQSIPFNEIS SKKISRFTSG FREFDRVLGG GIVPGSVVLL GGEPGIGKST IVLQSAGKIS LNEKVLYVTA EESLEQVKIR WERLNQSSID LKIFAETNLS LIIEEIKRLN PSFAIIDSIQ AIHNHEMESS PGSVSQVRAC SSELQNLAKD NNIALLIIGH VTKDGALAGP KTLEHLVDTV INFEGDNISS HRLLRSIKNR FGSTFEIGIF EMLEQGLREI KNPSSIFTNK ENISGVTTTI TNEGTRPLAV DIQALVNKTF YSNPRRTTTG ISINRLHQIL AVIEKHVGIK LSEFDCYIAT GGGFEINDPS SDLGVAISIL SSLKNIPPLV NSSFIGELGL SGQVRKSNNL RTKIEEAIRL GIKNIVVPKL EEEINNNFQN LINIKEISNI KEAVDYSLSE
|
| |