Gene P9211_01471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01471 
Symbolsms 
ID5730820 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp143554 
End bp144939 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content40% 
IMG OID641284491 
ProductDNA repair protein RadA 
Protein accessionYP_001550032 
Protein GI159902688 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAACGAA CAAGCACTAT TTATATCTGC CAGTCATGTG GGGCAGAAAC AAGTCAATTT 
TTTGGAAGAT GCCCAAGCTG TGGGGAATGG AATTCAATTA TTGAAGAAGC CATATCGCCG
TCGGGATTAA AAGCTAGAAA AGGGAGGTCA ACTCTCTCAA AAATCTCTTC TGGCAGCCGA
TCTGAAACAA TTTCATCAAT CAAAGATACC CCTATCGACA GAATAAAAAG CGGCTATATC
GAATTTGACC GAGTTCTCGG GGGAGGATTA GTTCCTGGAT CTCTTGTACT AATCGGAGGA
GATCCAGGAA TTGGGAAAAG CACATTGCTT TTACAAACAG CAACAGAAAT TGCTCTAAAT
AACAGTGTGC TCTACATAAC AGCTGAAGAA TCAGCAAGTC AAGTTCAACT GCGCTGGCAT
CGTTTAAATA AAGTTGAATC CAATCTTCAT ATTCTTGCTG AAACGGATTT AGAACTAATT
CTTGATGAAT TAGAAAAGCT AAACCCTGAG GTGGCAATTA TAGACAGCAT TCAAGCTTTA
CATGATGCAA CTATTTCAAG CACTCCAGGA TCAATTACCC AAGTAAGAGA ATGTGCAGCA
GCCTTGCAAC AAGTTTCTAA AAAGAAAAAC ATCGCACTAT TAATTGTTGG TCATGTGACA
AAAGAAGGGA TGCTTGCTGG ACCAAAGGTT CTTGAACACC TTGTTGATGC AGTAATGACA
TTTGAAGGTG ATCGTTTCGC AAGTCACAGA CTTCTAAGAG CAGTAAAAAA TAGATTCGGA
GCCACCAATG AACTTGGTGT GTTTGAAATG CAAAGCACGG GTCTTGTAGA AGTGACAAAT
CCCAGTGAAC TATTTCTCAG TTCGGAAGAA ACCTCTGGAG TTGCAACAAT TGTTGCGTGT
GAAGGAACAA GGCCCTTGGC AATAGATATA CAAGCACTCA TTAACCAAAC CACTTACGCA
TCTCCTAGGC GAACAGTGAC TGGTATAGGC AGCAATCGTC TTCATCAAAT ACTGGCTGTT
TTAGAAAAGC ACATAAATTT GGCTCTATCC CGTTTTGATT GCTATTTAGC AGTAGCAGGG
GGGTTAGAAG TAGACGAACC AGCAGCAGAT CTCGGAATTG CAGCCGCAAT TGTCTCAAGT
TTCAAAAACC TAAAAATTCC TAAAAACACA GTTCTTCTAG GCGAAATTGG ACTCGGAGGA
CAGCTTCGGA CTGTTGGCCA AATACCATTG AGGCTAAAAG AGGCAGAAAA ATTAGGTTTT
AAACAAGCAG TTGTCCCTAG CTCTACTGGC ATTGATAAAG AAAACAATGA ACTAACACTT
GAAATATTTG AAGCTTCAAC TATTAGCGAA GCCATACAAA TTATTCTTGA TCTTAAAAGT
CAATAA
 
Protein sequence
MKRTSTIYIC QSCGAETSQF FGRCPSCGEW NSIIEEAISP SGLKARKGRS TLSKISSGSR 
SETISSIKDT PIDRIKSGYI EFDRVLGGGL VPGSLVLIGG DPGIGKSTLL LQTATEIALN
NSVLYITAEE SASQVQLRWH RLNKVESNLH ILAETDLELI LDELEKLNPE VAIIDSIQAL
HDATISSTPG SITQVRECAA ALQQVSKKKN IALLIVGHVT KEGMLAGPKV LEHLVDAVMT
FEGDRFASHR LLRAVKNRFG ATNELGVFEM QSTGLVEVTN PSELFLSSEE TSGVATIVAC
EGTRPLAIDI QALINQTTYA SPRRTVTGIG SNRLHQILAV LEKHINLALS RFDCYLAVAG
GLEVDEPAAD LGIAAAIVSS FKNLKIPKNT VLLGEIGLGG QLRTVGQIPL RLKEAEKLGF
KQAVVPSSTG IDKENNELTL EIFEASTISE AIQIILDLKS Q