Gene NATL1_14791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_14791 
SymbolsrmB 
ID4780880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1189779 
End bp1191548 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content37% 
IMG OID640084760 
Productputative ATP-dependent RNA helicase 
Protein accessionYP_001015301 
Protein GI124026185 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.459404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.230286 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATA AAAATCCACA TGAGGATTCC TCATGTCCAG TAGAACAAGA TGTTCTCGAG 
GAATCTACTG AGGAAATATT AGAGGAAAGT GAGCAAAATT CAGAAAATGG ATTTTCTGAA
TTTAATTTTA GTGAAGAACT AATTCAAACA ATCTCTGATA AAGGCTACTC ATCACCTACT
CCGATTCAAA AAGCTGCAAT TCCTGAATTG CTTCTTGGTA GAGATTTAGT TGGACAAGCT
CAAACTGGTA CAGGAAAAAC CGCCGCATTT GCTTTACCAA TACTAGAAAG GTTGAAAAAG
AATGTTGGAC ATCCACAAGT TTTAGTTCTT GCTCCAACCC GCGAGTTGGC TATGCAAGTA
GCTGAATCAT TTCGAACATA TTCGGCAGGT CATCCGCATT TTAAAGTGCT TGCAATATAT
GGAGGCTCTG ATTTCCGAAA TCAAATTAAC ACTCTCAGGA GGGGAGTCGA TGTAGTCGTA
GGGACTCCGG GGCGAGTCAT GGATCACATG CGTCAAAAGA CTCTAAATAC TAGTCATCTG
AGTTGTTTAG TTCTAGACGA GGCTGATGAG ATGTTAAGGA TGGGGTTTAT TGATGACGTT
GAGTGGATTT TAGAACAATT GCCAGAGGAG AGACAACTAG TTTTATTCTC AGCGACAATG
CCATCTGAGA TCAGAAGATT ATCAAAAAAA TATTTAAATA GCCCTGCAGA AATAACTATA
AAAGCAACTG AATTGAAAGA AAGGCTTATA AGGCAAAGGT ATATAAGCGT TCAAAATGTT
TATAAAGTTA ATGCACTTCA AAGAGTACTT GAAGCTGTAT CAGAAGAAGG GGTAATAATA
TTTGCTAGAA CAAAAGCCAT AACAATCGTA GTAGCTGAAA AATTAGAATC ATATGGATAT
AACGTAGCAG TTTTAAATGG AGATATTCCT CAAAATCAAA GGGAACGAAC TGTTGAAAGA
TTAAGACAAG GATCTATCAA TATTTTAGTA GCTACAGATG TCGCAGCAAG AGGTCTTGAT
GTAGATCGAA TTGGTTTAGT AATAAATTAC GATATGCCAT TTGATCGCGA AGCATATGTT
CATAGAATTG GCAGAACAGG TCGTGCAGGA AGAAATGGTG AAGCAATTCT TTTTGTTAAT
CCTAGAGAAA GATCATTTTT AAGTAATCTT GAAAGAGCTG TAGGTCAGCC AATTGAAAAA
ATGGATATTC CAGACAATGA CCTTATAAAC AATAACCGAA TTAAGAAATT ACAAGCAAAA
TTAATAAAAG CCGCCTCAAC AGAGAGAGAC AACCCAGAAG AGGCTAACAT TTTAGAAGAA
TTAATAAAAA ATGTTGAAAA AGAATTAGAT ATAGATCCAA AAGACTTAAC ACTTGCCGCA
TTAAATTTGG CAGTTGGATT TAACGCACTC CTTGAGAATG GCAATGAGGA TTGGATAAGA
CAATCAGCTC AAAGGAATAC AAGAAATGAT CGCAGGGACA ATAATAAATT CAAGCAACGT
CGAAGAGGTG ATTTTGATAA CAACAGACTT GAAGATGAAA TGGACAGATT CAGGGTTGAA
GTTGGGCACC GAGATAGAGT TAAGCCTGGA AATCTAGTTG GGGCAATTGC TAACGAGGCC
GGACTAAAAG GGAGATCGAT AGGAAGAATA AGAATATTTG AAAATTACAG CCTAGTAGAT
CTCCCAAAAC AAATGCCTGA TAAAGTTTTC CAAGCTCTAA AGAAAGTCAA AGTAATGAAT
AGAGAGTTAC AGATAAATAG AGCAGACTAA
 
Protein sequence
MTNKNPHEDS SCPVEQDVLE ESTEEILEES EQNSENGFSE FNFSEELIQT ISDKGYSSPT 
PIQKAAIPEL LLGRDLVGQA QTGTGKTAAF ALPILERLKK NVGHPQVLVL APTRELAMQV
AESFRTYSAG HPHFKVLAIY GGSDFRNQIN TLRRGVDVVV GTPGRVMDHM RQKTLNTSHL
SCLVLDEADE MLRMGFIDDV EWILEQLPEE RQLVLFSATM PSEIRRLSKK YLNSPAEITI
KATELKERLI RQRYISVQNV YKVNALQRVL EAVSEEGVII FARTKAITIV VAEKLESYGY
NVAVLNGDIP QNQRERTVER LRQGSINILV ATDVAARGLD VDRIGLVINY DMPFDREAYV
HRIGRTGRAG RNGEAILFVN PRERSFLSNL ERAVGQPIEK MDIPDNDLIN NNRIKKLQAK
LIKAASTERD NPEEANILEE LIKNVEKELD IDPKDLTLAA LNLAVGFNAL LENGNEDWIR
QSAQRNTRND RRDNNKFKQR RRGDFDNNRL EDEMDRFRVE VGHRDRVKPG NLVGAIANEA
GLKGRSIGRI RIFENYSLVD LPKQMPDKVF QALKKVKVMN RELQINRAD