Gene NATL1_02051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_02051 
Symbolsms 
ID4780728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp190824 
End bp192203 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content38% 
IMG OID640083470 
ProductDNA repair protein RadA 
Protein accessionYP_001014034 
Protein GI124024918 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1066] Predicted ATP-dependent serine protease 
TIGRFAM ID[TIGR00416] DNA repair protein RadA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.619218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTCGTT CTGTCTCTAT TTATGTCTGT CAAAGTTGCG GTGCTCAAAC TAGGCAATTC 
TTTGGTCGCT GCAATAATTG TGGAGAATGG AACTCAATAA TAGAAGAAAA AATTAATCAA
AAATCAGATA AATCTATTTA CACAAAGATT AATTCTCCTA AGGAAAAATC TCCCTATCGT
TCAGAACTAA TAAGCCAAAC AAAAAACCAA CTTATTGAAC GCATTTCGAG TGGATATAAG
GAATTAGACA GGGTACTTGG AGGTGGCTTA GTGCCTGGAT CACTTGTATT AATTGGGGGA
GACCCAGGTA TTGGGAAAAG CACTCTTATT TTGCAAAGTG CGACTGAAAT GGCTCATCAC
AGATCAGTCC TTTATGTGGC TGCTGAAGAA TCTGCTCAAC AAGTAAAACT TAGATGGAAT
CGAATTGAGG ATGCTGAGTC CAATCTTCAT TTACTCGCAG AAACAGATCT AGAGATGGTT
ATAAAAGAGC TTGATCATTT GAAACCTGAG GTTGCAGTGA TTGATAGTAT TCAAGCTTTA
CATGATCAAA ATTTATCAAG TTCACCCGGC TCAGTAGCTC AAGTTAGAGA ATGTTCAGCA
GCTTTACAGC AAATTGCTAA ACGACAAAAC ATATCCCTTT TGATCATCGG GCACGTTACA
AAGGATGGAA TGTTAGCCGG ACCCAAAGTT CTTGAGCATC TTGTGGATGC AGTACTTACT
TTTGAGGGCG ATCGATTCGC TTCTCACAGA CTACTAAGAG GTGTGAAAAA TCGCTTTGGT
GCCACTTCTG AGCTTGGAGT CTTTGAAATG CAGGCAGATG GATTATCGGA GGTTCCTAAT
CCAAGTGAAT TATTTTTAAG CAAAACCTCT GCACCTGGGA TTTCAACAAT TGTTACTTGT
GAGGGAACAA GACCATTAGC CATCGATATA CAAGCACTTT TAAATCCCAC GAGTTATGCA
AGTCCAAGAA GAACTACAAC TGGCATTGAG ATAAACAGGC TTCATCAAAT CTTGGCAGTT
CTAGAAAAGA ATATGAATCT TTCGCTCTCA AGATATGACT GTTATCTGGC TGTAGCTGGG
GGATTAGAAG TAGAAGAACC TGGGGCTGAT TTAGGAATAG CTGCTGCAAT AGTTTCGAGC
TTTAAAGATA TTGAGCTTGA AGAAGGTGTA GTTTTTATAG GAGAGATAGG TTTGGCTGGT
CAATTAAGAT TAGTCAGACA AATGCAACAA CGAATTAACG AAGTGATAAG ACTTGGATAT
AAGACATTAA TTATCCCAGA TGGAATAGAC ACAAGTGAGT TTGAAACGAA TCAAAAATTA
AAAATATTAA AAGCTTCTAA TATTAATCAA GCTTTAATAT ATGCTTTAGA TAATAATTAA
 
Protein sequence
MSRSVSIYVC QSCGAQTRQF FGRCNNCGEW NSIIEEKINQ KSDKSIYTKI NSPKEKSPYR 
SELISQTKNQ LIERISSGYK ELDRVLGGGL VPGSLVLIGG DPGIGKSTLI LQSATEMAHH
RSVLYVAAEE SAQQVKLRWN RIEDAESNLH LLAETDLEMV IKELDHLKPE VAVIDSIQAL
HDQNLSSSPG SVAQVRECSA ALQQIAKRQN ISLLIIGHVT KDGMLAGPKV LEHLVDAVLT
FEGDRFASHR LLRGVKNRFG ATSELGVFEM QADGLSEVPN PSELFLSKTS APGISTIVTC
EGTRPLAIDI QALLNPTSYA SPRRTTTGIE INRLHQILAV LEKNMNLSLS RYDCYLAVAG
GLEVEEPGAD LGIAAAIVSS FKDIELEEGV VFIGEIGLAG QLRLVRQMQQ RINEVIRLGY
KTLIIPDGID TSEFETNQKL KILKASNINQ ALIYALDNN