Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_26521 |
Symbol | sms |
ID | 4777767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2341342 |
End bp | 2342736 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640088175 |
Product | DNA repair protein RadA |
Protein accession | YP_001018647 |
Protein GI | 124024340 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1066] Predicted ATP-dependent serine protease |
TIGRFAM ID | [TIGR00416] DNA repair protein RadA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGCACTG CTTCCGTCTA CGTCTGTCAG AGCTGTGGCG CTAAGACCAG CCAGTTTTTT GGTCGCTGCG CCAGTTGCGG GACTTGGAAT TCCTTGGTCG AACAGGTCGC TCATTCAACC GATGGTCGCC GCCGTAGTAA CAGTTTTGAT CCGGCGGTGG AACCAGCGGC GCGGCGCTCG ATGGCGATGG CTTCTCTAGG GGATCAACCC GTTTGCCGTC TTGCCAGTGG CTACGACGAA TTGGATCGAG TCCTGGGAGG CGGTCTGGTG CCTGGATCAA TGGTGTTAGT TGGAGGTGAT CCTGGTATTG GCAAGAGCAC GCTGCTGTTG CAGAGCGCCA CGGCTATGGC ACAGCAACAC TCTGTGCTTT ACGTGAGTGC TGAGGAGTCT GCCCAGCAGG TGAAGTTGCG TTGGCTGCGT CTTGAGGGGG TTGTGTCGGA CCTGCAGCTT TTGGCGGAAA CGGACTTGGA ATTGGTGCTT CAGGAATTGG AAGCACTGCG GCCTGCGGTG GCGATTATCG ACAGCATCCA GGCTTTGCAT GATGGGTCCC TTTCCAGTGC GCCTGGCTCT GTCGCTCAGG TGCGCGAGTG TGCAGCAGCT TTACAGCGTT TGGCTAAGCG TCAGGACACA GCCTTGATTT TGGTGGGCCA TGTCACCAAG GAGGGGATGC TGGCGGGCCC CAAGGTGCTT GAGCATTTGG TCGATGCAGT ACTCACTTTT GAGGGCGATC GATTTGCCAG TCATCGCCTT CTGAGGGCTG TAAAAAACCG CTTTGGAGCC ACCCATGAGC TGGGGGTGTT CGAGATGCAG GGCAAGGGCC TAGCTGAAGT AGGGAACCCC AGCGAACTCT TTCTCAAGGG TGAGTCTGCC TCTGGGGTGG CCACCATCGT GGCCTTTGAA GGGACCCGTT CATTGGTGGT GGATCTGCAG GCTTTGGTGG GTGTAACCAG CTATGCCAGT CCACGCCGTA CCGCTACAGG ATTAGGCACC AACCGCTTGC ACCAAATTCT GGCCGTATTG GAGAAGCACA TGGGCTTGCC ATTGTCACGC TATGACTGTT ATCTGGCTGT TGCTGGCGGA TTAGAAGTCG AAGAGCCTGC TGCTGATTTG GGTGTCGCCG CTGCCGTTGT TTCCAGTTAC CGGGATCTCA TGCTGCCAAA AGGCACAGTT TTGCTTGGAG AGTTGGGCTT AGGGGGGCAA TTGCGGCCAG TGGGGCAGCT TGGGCAGCGT CTTAAGGAAG CAGTTCGCTT AGGTTTTCAT CGTGCCGTAG TGCCGCAGGG CAGTGGCCTA GGACCCATCG GCGAGGAGCT GGAGTTGGAA CTACTTGAGG CTGCGAGTGT TACTGAGGCT TTGGTGATGG CCCTCGGTGT GAATCCAGCT GATGATGGCT GCTGA
|
Protein sequence | MRTASVYVCQ SCGAKTSQFF GRCASCGTWN SLVEQVAHST DGRRRSNSFD PAVEPAARRS MAMASLGDQP VCRLASGYDE LDRVLGGGLV PGSMVLVGGD PGIGKSTLLL QSATAMAQQH SVLYVSAEES AQQVKLRWLR LEGVVSDLQL LAETDLELVL QELEALRPAV AIIDSIQALH DGSLSSAPGS VAQVRECAAA LQRLAKRQDT ALILVGHVTK EGMLAGPKVL EHLVDAVLTF EGDRFASHRL LRAVKNRFGA THELGVFEMQ GKGLAEVGNP SELFLKGESA SGVATIVAFE GTRSLVVDLQ ALVGVTSYAS PRRTATGLGT NRLHQILAVL EKHMGLPLSR YDCYLAVAGG LEVEEPAADL GVAAAVVSSY RDLMLPKGTV LLGELGLGGQ LRPVGQLGQR LKEAVRLGFH RAVVPQGSGL GPIGEELELE LLEAASVTEA LVMALGVNPA DDGC
|
| |