Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09231 |
Symbol | umuC |
ID | 4717630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 794799 |
End bp | 796085 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 640078636 |
Product | putative UmuC protein |
Protein accession | YP_001009314 |
Protein GI | 123968456 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.986725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATTT CAAATATTGA TGCAATAGCT CTTATAGATG CCAATAATTT TTACGCGTCA TGTGAACAAA ATATTAATCC TCATTTGAGA AATAAACCAG TAGTAATTTT ATCTAATAAT GACGGATGTA TCATTGCAAG AAGCCCTGAA GCGCGAGCTT TAAAAATTAA AATGGGAACT CCGTATTTTA AGGTCAAAGA AAGACTAAAT AAATTAGATG TAGCAGTCTT AAGCTCAAAC TACTCGCTTT ATGGGGATAT GAGCAGAAGA CTAATGAATT TACTGAAAAA TTACTGTGAA CAGATAGAAA TTTATTCTAT TGACGAAGCA TTCGTCTCGA TTTCTAGACC TAATGATGAA AATCTATATC CTTGGGCAAG AAGCATAAGA TCATTAATAT ATCAGAATCT AGGGATTACC ATAACAGTAG GAATAGGAGA AAATAAAGTA AGAGCAAAAA TTGCTAATAA ACTAGCTAAA AATATTGATT ATTCAGCTGG AATATTTGAT TTAGCTAGAA CCGAAAATGA GAATAATTAT TTGAAAAGAA TTAGTATAGA TAAGATATGG GGAGTCGGGA AACAAACCTC TAATTGGTTG CAAAGTAAAG GTATTAAAAA TGCGAGAGAA CTAAGAGATA TGGAAGAAAA TGAAATCATT AAGAAATTAG GCATCGTAGG GAAAAGACTG CAATTAGAAC TGAAAGGCCA TAGATGCCTG CCCATAGAAA AAAACAAGAA ATCAAAAAAA GAAATTCAGG TGAGCAGGAG TTTCGGCACG CCTATCACAA AATTAGAAGA CTTAACTCAA GCACTAGCAA CTCACGCAAT AAAAGCCTCT GAAAAAATGA GAAGTCAGAA TTTGCAATCA TCTGATATTA GAGTATTTGC CAGAACCAGT AAATATTCAA GTCAAAATTA TCAAAGAAGT GCTCATAGAA AACTTACAAA TGCAACAGAT GACACAAATA AAATTTTAAA AATAGTAGTT GAATTATCTG AAGAAATTTA TAATCCCGAA TATAAATTCT CAAAAGCTGG TGTTTTAATG CAGGATTTAA CTAATAGCGA ATATTTACAG CAATCAGTTA TCAATTACAA ATCTCAGAAA GACTTAAAAA AATCAGCAAA TCTTATGAAA ACGATTGATT TATTAAATAA AAGATTTAAT AACAATGCAA TTACATGGGC TATTACAAAA AATCCACAAA GTTGGAAGAT GAATAATAAT TTCTTAAGTC GCTCATCTAC AACTGATATA GAACAAATCC CAACTATAGT GAAGTAA
|
Protein sequence | MRISNIDAIA LIDANNFYAS CEQNINPHLR NKPVVILSNN DGCIIARSPE ARALKIKMGT PYFKVKERLN KLDVAVLSSN YSLYGDMSRR LMNLLKNYCE QIEIYSIDEA FVSISRPNDE NLYPWARSIR SLIYQNLGIT ITVGIGENKV RAKIANKLAK NIDYSAGIFD LARTENENNY LKRISIDKIW GVGKQTSNWL QSKGIKNARE LRDMEENEII KKLGIVGKRL QLELKGHRCL PIEKNKKSKK EIQVSRSFGT PITKLEDLTQ ALATHAIKAS EKMRSQNLQS SDIRVFARTS KYSSQNYQRS AHRKLTNATD DTNKILKIVV ELSEEIYNPE YKFSKAGVLM QDLTNSEYLQ QSVINYKSQK DLKKSANLMK TIDLLNKRFN NNAITWAITK NPQSWKMNNN FLSRSSTTDI EQIPTIVK
|
| |