Gene P9211_07071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07071 
SymbolumuC 
ID5730144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp620283 
End bp621551 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content43% 
IMG OID641285070 
Productputative UmuC protein 
Protein accessionYP_001550592 
Protein GI159903248 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.109713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATAG CATTGATTGA TGGCAATAAC TTTTACGCAG CCTGCGAGGA AGCTATTGAC 
CCTAAACTGA CTGGTCGCCC CTTGGTGGTC CTATCCAACA ATGATGGGTG CGTCATAGCA
AGAAATGCCA AAGCTCGTCG TCTTGGGGTC TTAATGGGGG CACCCTATTT CAAAATACGT
CATGAACTAA ATAGGCTTGA TGTCGAAGTG CGCAGTTCGA ACTACGCACT ATACGGCGAC
ATGAGTCATC GCCTAATGAG CCTACTTACA ATGCATTGCG AGGATTTGGA AATCTATTCA
ATTGACGAAG CATTTGCCAA AATCAACCGT CCTCCTGACC AAAGTCTTCA TCCTTGGGCA
CGTCAATTAC GAGCATCTAT TTACAAAAGT CTTGGCCTAC CAATTGCAAT TGGCATCGGT
GCAAGCAAAA GCCAAGCCAA ACTCGCCAAT TACTTAGCCA AAACAGCCTC CCACCATGCG
GGTATCTTTG ACCTAGAGAA GGCTAAAAGT CCAGAAGCAT GTCTTGAAAA CATTGCCATA
GAAAATGTTT GGGGTATTGG CCGAAAATTA GCCCGCTGGT GCCGTATACG AGGCATCACA
AATGCCAAAC AATTTCTTAA CATGCCAAGT AATGAAGTGA GATCAAAATT TGGGGTTACA
GGCATACGTC TACAGAACGA ATTGCAAGGG ATTACTTGTT TACCTCTATC AACTAAACCA
GCAGCGAAAC AAGAGACTTG TATTAGTAGA AGCTTTAAAA GGCCGATAAG TACTATCGAA
GAATTACGTC AGGCAATATC AACATACGTT GTGAAAGCAA GCGAGAAGCT AAGAATGCAA
CAACAACGGG CTGGCGCTAT CACTGTTTTC ACGCGTACAA GTGCCTATAC ACCATATTTT
TATAGCCAAG CAGCAACTAA ACGCCTTAGT GTACATAGCA ATGATACATC CATACTGCTC
GCAAACTCAC TTGATTTAAC AAAGCGTATA TTCCGCCCTC ATCGTCTACT AGTAAAGGCA
GGGGTTATAA TGCAAGACCT CGTAGACAGC GAACATCTAC AACTAAATCT CCTTGAAACA
TTTAATCCAG AAAAGACACA CCAACGCGAA CGACTAATGC AAACAATCGA TAATCTCAAC
AAACGCTACG GCAATGACAC AATAAAATGG GCTGTATGTG GAACAAACCA AACTTGGAGA
ATGCATCGTA ATCATCTAAG TCCGGCTGCA ACAACACGCC TAACAGACAT TCCCACTGTA
AAAGTATAA
 
Protein sequence
MSIALIDGNN FYAACEEAID PKLTGRPLVV LSNNDGCVIA RNAKARRLGV LMGAPYFKIR 
HELNRLDVEV RSSNYALYGD MSHRLMSLLT MHCEDLEIYS IDEAFAKINR PPDQSLHPWA
RQLRASIYKS LGLPIAIGIG ASKSQAKLAN YLAKTASHHA GIFDLEKAKS PEACLENIAI
ENVWGIGRKL ARWCRIRGIT NAKQFLNMPS NEVRSKFGVT GIRLQNELQG ITCLPLSTKP
AAKQETCISR SFKRPISTIE ELRQAISTYV VKASEKLRMQ QQRAGAITVF TRTSAYTPYF
YSQAATKRLS VHSNDTSILL ANSLDLTKRI FRPHRLLVKA GVIMQDLVDS EHLQLNLLET
FNPEKTHQRE RLMQTIDNLN KRYGNDTIKW AVCGTNQTWR MHRNHLSPAA TTRLTDIPTV
KV