Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1965 |
Symbol | umuC |
ID | 6144123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1989260 |
End bp | 1990528 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641616841 |
Product | DNA polymerase V subunit UmuC |
Protein accession | YP_001744017 |
Protein GI | 170683122 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000119892 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.867639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTGCCC TCTGTGATGT AAACGCGTTT TATGCCAGCT GTGAGACGGT GTTTCGCCCT GATTTATGGG GTAAACCAGT GGTTGTGCTA TCGAATAATG ACGGTTGCGT TATCGCCCGA AACGCTGAGG CAAAGGCGCT GGGTGTTAAA ATGGGCGATC CCTGGTTCAA ACAAAAAGAT CTGTTTCGTC GCTGTGGCGT GGTTTGCTTT AGCAGCAATT ATGAGCTTTA CGCAGACATG AGCAATCGGG TGATGTCGAC GCTGGAAGAA CTATCGCCCC GCGTCGAGAT TTACAGTATT GATGAGGCAT TCTGCGATCT GACCGGTGTG CGTAATTGTC GCGATCTGAC TGATTTTGGC AGAGAAATTC GCGCAACGGT GCTACAACGT ACCCATCTTA CTGTTGGTGT GGGGATCGCC CAGACCAAAA CGTTGGCTAA GCTTGCCAAT CATGCGGCAA AAAAATGGCA GCGGCAGACG GGTGGGGTGG TGGATTTATC AAATCTGGAA CGCCAGCGTA AATTAATGTC TGCTCTCCCC GTGGATGAAG TCTGGGGGAT TGGACGGCGG ATCAGCAAAA AACTGGATGC GATGGGGATC AAAACCGTTC TCGATTTGGC GGATACAGAT ATCCGGTTTA TCCGTAAACA TTTTAATGTC GTGCTCGAAA GAACGGTGCG TGAACTGCGC GGCGAACCCT GTTTGCAACT GGAAGAGTTT GCACCGACGA AGCAGGAAAT TATCTGTTCC CGTTCGTTTG GTGAACGCAT CACGGATTAT ACGTCGATGC GGCAGGCCAT TTGTAGTTAC GCTGCCCGGG CGGCGGAAAA ACTTCGCAGC GAGCATCAAT ATTGTCGGTT TATCTCCACG TTTATTAAGA CGTCACCATT TGCGCTCAAT GAACCTTATT ACGGCAATAG CGCGTCGGTA AAACTGCTGA CGCCCACTCA GGACAGCAGG GATATCATTA ACGCCGCTAC GCGATCTCTG GATGCCATCT GGCAAGCGGG CCATCGTTAT CAAAAAGCGG GCGTGATGCT GGGGGATTTC TTCAGTCAGG GAGTCGCGCA GCTCAATTTA TTCGATGACA ACGCACCGCG CCCCGGGAGT GAGCAATTGA TGGCGGTAAT GGATACGCTA AATGCTAAAG AGGGCAGAGG AACACTCTAT TTTGCCGGGC AGGGGATCCA GCAACAATGG CAGATGAAGC GTGCCATGCT TTCACCACGT TATACAACGC GAAGTTCTGA TTTACTGCGG GTCAAATAA
|
Protein sequence | MFALCDVNAF YASCETVFRP DLWGKPVVVL SNNDGCVIAR NAEAKALGVK MGDPWFKQKD LFRRCGVVCF SSNYELYADM SNRVMSTLEE LSPRVEIYSI DEAFCDLTGV RNCRDLTDFG REIRATVLQR THLTVGVGIA QTKTLAKLAN HAAKKWQRQT GGVVDLSNLE RQRKLMSALP VDEVWGIGRR ISKKLDAMGI KTVLDLADTD IRFIRKHFNV VLERTVRELR GEPCLQLEEF APTKQEIICS RSFGERITDY TSMRQAICSY AARAAEKLRS EHQYCRFIST FIKTSPFALN EPYYGNSASV KLLTPTQDSR DIINAATRSL DAIWQAGHRY QKAGVMLGDF FSQGVAQLNL FDDNAPRPGS EQLMAVMDTL NAKEGRGTLY FAGQGIQQQW QMKRAMLSPR YTTRSSDLLR VK
|
| |