Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04371 |
Symbol | rpoC1 |
ID | 4776364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 440316 |
End bp | 442220 |
Gene Length | 1905 bp |
Protein Length | 634 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640085941 |
Product | DNA-directed RNA polymerase subunit gamma |
Protein accession | YP_001016454 |
Protein GI | 124022147 |
COG category | [K] Transcription |
COG ID | [COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit |
TIGRFAM ID | [TIGR02387] DNA-directed RNA polymerase, gamma subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAACA GCAACCTCCG TACAGAGAAC CACTTCGATT ACGTCAAGAT CACACTTGCT TCACCAGACC GGGTGATGGA GTGGGGTCAG CGGACACTTC CTAATGGGCA GGTAGTGGGT GAAGTGACGA AGCCGGAGAC GATCAATTAC CGCACCCTTA AGCCTGAGAT GGACGGGCTC TTTTGCGAGA AGATCTTCGG CCCTTCAAAG GATTGGGAGT GTCATTGCGG GAAATATAAG CGTGTTCGAC ACCGTGGCAT TGTCTGTGAG CGCTGTGGCG TGGAGGTCAC TGAAAGTCGG GTCCGGCGTC ATCGCATGGG CTTCATCAAG TTGGCTGCTC CTGTCTCCCA CGTTTGGTAT TTGAAAGGGA TACCTAGTTA TGTGGCGATC TTGTTAGATA TGCCTCTACG CGATGTTGAA CAGATTGTTT ACTTCAACTG TTATGTCGTG CTCGATCCTG GCGATCATAA GGAACTCAAG TATAAGCAGT TACTGACTGA GGATGAGTGG CTAGAGATCG AAGATGAGAT CTATGCGGAA GATTCCACGA TTGAGAATGA GCCGATAGTT GGCATCGGGG CTGAAGCCCT AAAGCAATTG CTAGAAGATT TGAATTTGGC TGAGGTTGCT GAACAGTTGA GAGAGGATAT TTCCTCTAGC AAGGGTCAGA AACGGGCCAA GTTGATTAAG CGTTTGAGAG TGATTGATAA CTTTATTGCC ACTAACGCAA GGCCTGAGTG GATGGTGCTG GATGCAATTC CAGTTATTCC TCCAGATTTG CGCCCCATGG TGCAATTAGA TGGTGGACGT TTCGCTACAT CCGATCTAAA TGATCTCTAC CGTCGGGTGA TTAACCGCAA CAACCGTCTT GCGCGGCTTC AGGAAATCTT GGCTCCTGAG ATTATCGTTC GCAATGAGAA GCGGATGCTC CAAGAGGCAG TTGATGCTCT GATCGATAAT GGTCGTCGTG GTCGAACTGT TGTTGGTGCT AATAATCGCC CGTTGAAGTC ACTTAGTGAC ATCATTGAGG GCAAACAGGG TCGCTTCCGC CAGAACTTGT TAGGTAAGCG GGTCGACTAT TCTGGTCGTT CCGTGATCGT GGTGGGCCCC AAGCTGAAGA TGCATCAGTG CGGCTTGCCT AAAGAGATGG CAATCGAGCT ATTCCAGCCT TTTGTGATCC ATCGCCTGAT TCGTCAAAAC ATTGTCAACA ACATCAAAGC CGCTAAGAAG TTGATCCAGC GAGCTGATGA TGAGGTCATG CAGGTGCTGC AGGAGGTCAT TGAGGGGCAC CCGATCCTTT TAAACCGCGC TCCAACTCTG CACCGTCTGG GTATACAGGC TTTTGAACCG AAGCTGGTTG CTGGCCGTGC CATTCAGCTT CACCCGTTGG TTTGCCCTGC CTTTAACGCC GACTTTGATG GCGACCAAAT GGCTGTTCAT GTGCCTTTGG CGATTGAGGC TCAGACGGAA GCGCGAATGT TGATGTTGGC CAGCAACAAC ATCCTTTCAC CTGCCACTGG CGATCCGATC ATCACGCCTT CCCAGGACAT GGTGCTTGGC TCCTATTACC TAACGGCGAT TAAGCCTGGA GCTTCTGTTC CTGAGTTTGG CGATCAATCC AGAACCTATG CTGGTTTAGA GGACGTCATC CATGCGTTTG AAGATAAAAG ACTCCTACTG CATGACTGGG TTTGGGTTCG CTTCAACGGT GAGGTAGAGG ATGAAGATGA AATTGACAAA CCTCTCAAGG CTGAATCACT TAGTGATGGC ACTCGCATTG AGCAGTGGAC TTATCGCCGA GATCGTTTCG ACGAAGACGG AGCATTGATC AGTCGCTACA TCCTTACAAC AGTGGGACGC GTGGTGATGA ATTACACAAT TATCGATGCG GTGGCCGCCG CCTGA
|
Protein sequence | MTNSNLRTEN HFDYVKITLA SPDRVMEWGQ RTLPNGQVVG EVTKPETINY RTLKPEMDGL FCEKIFGPSK DWECHCGKYK RVRHRGIVCE RCGVEVTESR VRRHRMGFIK LAAPVSHVWY LKGIPSYVAI LLDMPLRDVE QIVYFNCYVV LDPGDHKELK YKQLLTEDEW LEIEDEIYAE DSTIENEPIV GIGAEALKQL LEDLNLAEVA EQLREDISSS KGQKRAKLIK RLRVIDNFIA TNARPEWMVL DAIPVIPPDL RPMVQLDGGR FATSDLNDLY RRVINRNNRL ARLQEILAPE IIVRNEKRML QEAVDALIDN GRRGRTVVGA NNRPLKSLSD IIEGKQGRFR QNLLGKRVDY SGRSVIVVGP KLKMHQCGLP KEMAIELFQP FVIHRLIRQN IVNNIKAAKK LIQRADDEVM QVLQEVIEGH PILLNRAPTL HRLGIQAFEP KLVAGRAIQL HPLVCPAFNA DFDGDQMAVH VPLAIEAQTE ARMLMLASNN ILSPATGDPI ITPSQDMVLG SYYLTAIKPG ASVPEFGDQS RTYAGLEDVI HAFEDKRLLL HDWVWVRFNG EVEDEDEIDK PLKAESLSDG TRIEQWTYRR DRFDEDGALI SRYILTTVGR VVMNYTIIDA VAAA
|
| |