Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_01471 |
Symbol | sms |
ID | 5730820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 143554 |
End bp | 144939 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641284491 |
Product | DNA repair protein RadA |
Protein accession | YP_001550032 |
Protein GI | 159902688 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAACGAA CAAGCACTAT TTATATCTGC CAGTCATGTG GGGCAGAAAC AAGTCAATTT TTTGGAAGAT GCCCAAGCTG TGGGGAATGG AATTCAATTA TTGAAGAAGC CATATCGCCG TCGGGATTAA AAGCTAGAAA AGGGAGGTCA ACTCTCTCAA AAATCTCTTC TGGCAGCCGA TCTGAAACAA TTTCATCAAT CAAAGATACC CCTATCGACA GAATAAAAAG CGGCTATATC GAATTTGACC GAGTTCTCGG GGGAGGATTA GTTCCTGGAT CTCTTGTACT AATCGGAGGA GATCCAGGAA TTGGGAAAAG CACATTGCTT TTACAAACAG CAACAGAAAT TGCTCTAAAT AACAGTGTGC TCTACATAAC AGCTGAAGAA TCAGCAAGTC AAGTTCAACT GCGCTGGCAT CGTTTAAATA AAGTTGAATC CAATCTTCAT ATTCTTGCTG AAACGGATTT AGAACTAATT CTTGATGAAT TAGAAAAGCT AAACCCTGAG GTGGCAATTA TAGACAGCAT TCAAGCTTTA CATGATGCAA CTATTTCAAG CACTCCAGGA TCAATTACCC AAGTAAGAGA ATGTGCAGCA GCCTTGCAAC AAGTTTCTAA AAAGAAAAAC ATCGCACTAT TAATTGTTGG TCATGTGACA AAAGAAGGGA TGCTTGCTGG ACCAAAGGTT CTTGAACACC TTGTTGATGC AGTAATGACA TTTGAAGGTG ATCGTTTCGC AAGTCACAGA CTTCTAAGAG CAGTAAAAAA TAGATTCGGA GCCACCAATG AACTTGGTGT GTTTGAAATG CAAAGCACGG GTCTTGTAGA AGTGACAAAT CCCAGTGAAC TATTTCTCAG TTCGGAAGAA ACCTCTGGAG TTGCAACAAT TGTTGCGTGT GAAGGAACAA GGCCCTTGGC AATAGATATA CAAGCACTCA TTAACCAAAC CACTTACGCA TCTCCTAGGC GAACAGTGAC TGGTATAGGC AGCAATCGTC TTCATCAAAT ACTGGCTGTT TTAGAAAAGC ACATAAATTT GGCTCTATCC CGTTTTGATT GCTATTTAGC AGTAGCAGGG GGGTTAGAAG TAGACGAACC AGCAGCAGAT CTCGGAATTG CAGCCGCAAT TGTCTCAAGT TTCAAAAACC TAAAAATTCC TAAAAACACA GTTCTTCTAG GCGAAATTGG ACTCGGAGGA CAGCTTCGGA CTGTTGGCCA AATACCATTG AGGCTAAAAG AGGCAGAAAA ATTAGGTTTT AAACAAGCAG TTGTCCCTAG CTCTACTGGC ATTGATAAAG AAAACAATGA ACTAACACTT GAAATATTTG AAGCTTCAAC TATTAGCGAA GCCATACAAA TTATTCTTGA TCTTAAAAGT CAATAA
|
Protein sequence | MKRTSTIYIC QSCGAETSQF FGRCPSCGEW NSIIEEAISP SGLKARKGRS TLSKISSGSR SETISSIKDT PIDRIKSGYI EFDRVLGGGL VPGSLVLIGG DPGIGKSTLL LQTATEIALN NSVLYITAEE SASQVQLRWH RLNKVESNLH ILAETDLELI LDELEKLNPE VAIIDSIQAL HDATISSTPG SITQVRECAA ALQQVSKKKN IALLIVGHVT KEGMLAGPKV LEHLVDAVMT FEGDRFASHR LLRAVKNRFG ATNELGVFEM QSTGLVEVTN PSELFLSSEE TSGVATIVAC EGTRPLAIDI QALINQTTYA SPRRTVTGIG SNRLHQILAV LEKHINLALS RFDCYLAVAG GLEVDEPAAD LGIAAAIVSS FKNLKIPKNT VLLGEIGLGG QLRTVGQIPL RLKEAEKLGF KQAVVPSSTG IDKENNELTL EIFEASTISE AIQIILDLKS Q
|
| |