Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07071 |
Symbol | umuC |
ID | 5730144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 620283 |
End bp | 621551 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641285070 |
Product | putative UmuC protein |
Protein accession | YP_001550592 |
Protein GI | 159903248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.109713 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATAG CATTGATTGA TGGCAATAAC TTTTACGCAG CCTGCGAGGA AGCTATTGAC CCTAAACTGA CTGGTCGCCC CTTGGTGGTC CTATCCAACA ATGATGGGTG CGTCATAGCA AGAAATGCCA AAGCTCGTCG TCTTGGGGTC TTAATGGGGG CACCCTATTT CAAAATACGT CATGAACTAA ATAGGCTTGA TGTCGAAGTG CGCAGTTCGA ACTACGCACT ATACGGCGAC ATGAGTCATC GCCTAATGAG CCTACTTACA ATGCATTGCG AGGATTTGGA AATCTATTCA ATTGACGAAG CATTTGCCAA AATCAACCGT CCTCCTGACC AAAGTCTTCA TCCTTGGGCA CGTCAATTAC GAGCATCTAT TTACAAAAGT CTTGGCCTAC CAATTGCAAT TGGCATCGGT GCAAGCAAAA GCCAAGCCAA ACTCGCCAAT TACTTAGCCA AAACAGCCTC CCACCATGCG GGTATCTTTG ACCTAGAGAA GGCTAAAAGT CCAGAAGCAT GTCTTGAAAA CATTGCCATA GAAAATGTTT GGGGTATTGG CCGAAAATTA GCCCGCTGGT GCCGTATACG AGGCATCACA AATGCCAAAC AATTTCTTAA CATGCCAAGT AATGAAGTGA GATCAAAATT TGGGGTTACA GGCATACGTC TACAGAACGA ATTGCAAGGG ATTACTTGTT TACCTCTATC AACTAAACCA GCAGCGAAAC AAGAGACTTG TATTAGTAGA AGCTTTAAAA GGCCGATAAG TACTATCGAA GAATTACGTC AGGCAATATC AACATACGTT GTGAAAGCAA GCGAGAAGCT AAGAATGCAA CAACAACGGG CTGGCGCTAT CACTGTTTTC ACGCGTACAA GTGCCTATAC ACCATATTTT TATAGCCAAG CAGCAACTAA ACGCCTTAGT GTACATAGCA ATGATACATC CATACTGCTC GCAAACTCAC TTGATTTAAC AAAGCGTATA TTCCGCCCTC ATCGTCTACT AGTAAAGGCA GGGGTTATAA TGCAAGACCT CGTAGACAGC GAACATCTAC AACTAAATCT CCTTGAAACA TTTAATCCAG AAAAGACACA CCAACGCGAA CGACTAATGC AAACAATCGA TAATCTCAAC AAACGCTACG GCAATGACAC AATAAAATGG GCTGTATGTG GAACAAACCA AACTTGGAGA ATGCATCGTA ATCATCTAAG TCCGGCTGCA ACAACACGCC TAACAGACAT TCCCACTGTA AAAGTATAA
|
Protein sequence | MSIALIDGNN FYAACEEAID PKLTGRPLVV LSNNDGCVIA RNAKARRLGV LMGAPYFKIR HELNRLDVEV RSSNYALYGD MSHRLMSLLT MHCEDLEIYS IDEAFAKINR PPDQSLHPWA RQLRASIYKS LGLPIAIGIG ASKSQAKLAN YLAKTASHHA GIFDLEKAKS PEACLENIAI ENVWGIGRKL ARWCRIRGIT NAKQFLNMPS NEVRSKFGVT GIRLQNELQG ITCLPLSTKP AAKQETCISR SFKRPISTIE ELRQAISTYV VKASEKLRMQ QQRAGAITVF TRTSAYTPYF YSQAATKRLS VHSNDTSILL ANSLDLTKRI FRPHRLLVKA GVIMQDLVDS EHLQLNLLET FNPEKTHQRE RLMQTIDNLN KRYGNDTIKW AVCGTNQTWR MHRNHLSPAA TTRLTDIPTV KV
|
| |