Gene NATL1_18851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_18851 
SymbolrpoC1 
ID4780050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1544451 
End bp1546358 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content39% 
IMG OID640085174 
ProductDNA-directed RNA polymerase subunit gamma 
Protein accessionYP_001015705 
Protein GI124026590 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02387] DNA-directed RNA polymerase, gamma subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAATA GCAATCTACG TACGGAAAAT CATTTTGATT ATGTAAAAAT AAAATTAGCA 
TCGCCTGAAA GAGTAATGGA GTGGGGGCAA AGGACTTTGC CTAATGGTCA GGTTGTAGGG
GAAGTAACTA AGCCAGAAAC AATAAATTAT CGAACACTCA AGCCTGAAAT GGATGGGTTG
TTTTGTGAAA AGATTTTTGG TCCGTCCAAG GATTGGGAAT GTCATTGTGG GAAATATAAG
AGGGTTAGAC ATCGTGGAAT TGTTTGTGAG AGGTGTGGTG TTGAAGTAAC AGAAAGTCGA
GTTAGGAGAC ATAGAATGGG ATTTATTAAA TTAGCAGCTC CTGTATCACA TGTTTGGTAT
CTAAAAGGTA TTCCTAGCTA TGTAGCTATT TTGTTAGATA TGCCTCTTAG AGATGTTGAG
CAAATCGTCT ACTTTAATTG TTATGTGGTT TTAGATATTG GAGATAGTAA AGATTTAAAA
TATAAGCAGC TATTAACAGA AGATGAATGG CTTGAGATTG AAGACGAGAT TTATGCCGAG
GATTCAACTA TTGAGAATGA ACCCATCGTT GGCATAGGTG CAGAGGCCTT AAAACAATTA
CTTGAAGATT TAAATTTAAA AGAAGTTGCT GAGCAATTAC GAGAAGATAT TGCAACAAGT
AAAGGTCAAA AAAGGGCTAA GCTTATAAAA AGACTAAGGG TAATAGATAA CTTTATTGCT
ACAAGCGCAA GTCCAGAATG GATGGTTTTA GATGCGATAC CTGTAATACC TCCTGATTTG
AGGCCTATGG TGCAACTAGA TGGAGGTAGA TTCGCTACCT CAGATTTAAA TGATTTATAT
AGAAGAGTAA TCAATAGGAA CAACCGTCTT GCTCGTCTTC AGGAAATACT AGCTCCGGAA
ATAATAGTCA GAAATGAAAA AAGGATGCTG CAAGAGGCTG TAGATGCACT TATTGATAAT
GGTAGAAGAG GACGAACTGT TGTTGGCGCA AATAATCGCC CCTTGAAATC ATTAAGCGAT
ATTATCGAGG GTAAGCAAGG TAGATTTAGA CAGAATTTGC TTGGGAAAAG AGTTGATTAT
TCAGGTCGCT CGGTAATAGT AGTTGGCCCC AAGTTGAAAA TGCATCAATG TGGATTGCCA
AAGGAAATGG CGATAGAACT TTTTCAGCCT TTTGTAATTC ATCGATTGAT TCGCCAGAAC
ATTGTCAATA ACATAAAAGC TGCAAAGAAA TTGATACAGA AAGCAGACGA TGAAGTAATG
CAGGTATTAC AGGAAGTCAT AGAGGGTCAT CCTATCCTGC TTAACCGTGC TCCTACTTTG
CACCGATTAG GCATTCAAGC TTTTGAACCT AAATTAGTTG CTGGACGAGC TATACAGCTA
CATCCATTAG TATGTCCAGC TTTTAATGCT GATTTTGATG GAGATCAAAT GGCTGTTCAT
GTGCCTTTGG CAATTGAAGC GCAAACAGAA GCAAGGATGT TAATGCTAGC GAGTAACAAT
ATTCTTTCCC CTGCAACTGG GGATCCAATA GTTACCCCAT CTCAAGATAT GGTTTTAGGG
TCTTACTACC TTACTGCAAT TCAGCCTCAA GCAAAGCAGC CAAAGTTCGG AGATTATTCA
GATACCTATG CATCACTTGA AGATGTTTTA CAAGCTCTTG AAGATAAAAG AATAGATTTA
CATGATTGGG TTTGGGTTCG ATTTTCTGGT GAAATTGAGG ATGACGATGA GCTTCAGAAA
CCGCTTAAGT CAGAGACTTT GAAAGACGGT ACCCGTATTG AGGAATGGAC ATATCGACGT
GACCGATTAG ATGAAGATGG AAGTTTAATT AGCCGTTATA TATTGACTAC TGTTGGCCGT
GTCGTAATGA ACCATACAAT TATTGATGCC GTAGCAGCCA CTTCGTAA
 
Protein sequence
MTNSNLRTEN HFDYVKIKLA SPERVMEWGQ RTLPNGQVVG EVTKPETINY RTLKPEMDGL 
FCEKIFGPSK DWECHCGKYK RVRHRGIVCE RCGVEVTESR VRRHRMGFIK LAAPVSHVWY
LKGIPSYVAI LLDMPLRDVE QIVYFNCYVV LDIGDSKDLK YKQLLTEDEW LEIEDEIYAE
DSTIENEPIV GIGAEALKQL LEDLNLKEVA EQLREDIATS KGQKRAKLIK RLRVIDNFIA
TSASPEWMVL DAIPVIPPDL RPMVQLDGGR FATSDLNDLY RRVINRNNRL ARLQEILAPE
IIVRNEKRML QEAVDALIDN GRRGRTVVGA NNRPLKSLSD IIEGKQGRFR QNLLGKRVDY
SGRSVIVVGP KLKMHQCGLP KEMAIELFQP FVIHRLIRQN IVNNIKAAKK LIQKADDEVM
QVLQEVIEGH PILLNRAPTL HRLGIQAFEP KLVAGRAIQL HPLVCPAFNA DFDGDQMAVH
VPLAIEAQTE ARMLMLASNN ILSPATGDPI VTPSQDMVLG SYYLTAIQPQ AKQPKFGDYS
DTYASLEDVL QALEDKRIDL HDWVWVRFSG EIEDDDELQK PLKSETLKDG TRIEEWTYRR
DRLDEDGSLI SRYILTTVGR VVMNHTIIDA VAATS