Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1270 |
Symbol | uvrC |
ID | 6142852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1264810 |
End bp | 1266642 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616148 |
Product | excinuclease ABC subunit C |
Protein accession | YP_001743331 |
Protein GI | 170680797 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000021646 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.000709379 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGTGATC AGTTTGACGC AAAAGCGTTT TTAAAAACCG TAACCAGCCA GCCAGGCGTT TATCGCATGT ACGATGCTGG TGGTACGGTT ATCTATGTCG GCAAAGCGAA AGACCTGAAA AAACGGCTTT CCAGCTATTT CCGTAGCAAC CTCGCTTCGC GCAAAACCGA AGCGCTGGTC GCCCAGATCC AGCAAATTGA TGTAACGGTT ACTCATACAG AAACCGAAGC GCTGTTGCTG GAACACAACT ACATCAAACT CTATCAGCCG CGTTACAACG TTTTGCTACG CGATGATAAA TCATATCCTT TTATCTTCCT GAGTGGCGAT ACCCACCCGC GTCTGGCGAT GCATCGTGGT GCAAAGCATG CCAAAGGTGA ATATTTCGGC CCGTTCCCGA ATGGCTATGC CGTACGTGAA ACACTGGCGC TACTGCAAAA GATTTTCCCC ATTCGCCAGT GCGAAAACAG TGTTTATCGC AATCGCTCGC GTCCGTGTCT GCAATATCAG ATAGGACGTT GTCTGGGGCC GTGCGTTGAA GGACTGGTGA GTGAAGAAGA ATACGCTCAG CAGGTCGAGT ATGTGCGCCT GTTTTTGTCT GGCAAAGATG ATCAGGTGCT TACGCAACTG ATTAGCCGTA TGGAAACTGC CAGCCAGAAT CTGGAGTTTG AAGAAGCGGC ACGTATTCGC GACCAAATTC AGGCGGTGCG ACGCGTCACC GAAAAACAGT TTGTGTCCAA TACCGGCGAC GACCTCGACG TTATTGGTGT GGCGTTCGAT GCGGGCATGG CTTGTGTCCA CGTATTGTTC ATTCGTCAGG GCAAAGTGCT TGGCAGCCGC AGCTATTTCC CGAAAGTGCC TGGCGGTACG GAACTGAGCG AGGTGGTGGA AACCTTCGTA GGACAGTTCT ATTTACAAGG CAGCCAGATG CGCACCTTAC CGGGTGAGAT CCTGCTCGAT TTTAATCTTA GCGATAAAAC GCTGCTCGCC GATTCCCTTT CAGAACTGGC GGGACGCAAG ATTAATGTTC AAACCAAACC TCGCGGCGAT AGGGCGCGTT ATCTGAAACT CGCGCGCACC AATGCGGCGA CGGCCTTAAC CAGCAAACTT TCGCAGCAAT CTACCGTTCA CCAGCGACTG ACCGCGCTTG CCAGCGTGTT GAAATTGCCG GAAGTGAAGC GGATGGAGTG CTTTGACATC AGCCATACCA TGGGTGAACA AACCGTCGCT TCCTGTGTGG TGTTTGATGC TAACGGCCCG CTGCGTGCGG AGTATCGGCG CTATAACATT ACAGGCATCA GGCCGGGCGA TGATTATGCG GCGATGAATC AGGTGCTGCG TCGGCGTTAT GGTAAAGCCA TTGACGACAG TAAGATCCCT GATGTGATCC TTATCGACGG CGGCAAAGGC CAGCTTGCGC AGGCGAAAAA TGTCTTCGCC GAACTGGATG TCTCATGGGA TAAAAATCAT CCGCTGCTAC TTGGCGTTGC CAAAGGAGCA GATCGTAAGG CTGGGCTGGA AACGCTGTTC TTTGAGCCGG AAGGTGAGGG ATTTAGTTTG CCGCCAGATT CTCCCGCGCT GCATGTTATC CAGCATATTC GCGATGAATC ACATGATCAC GCGATTGGCG GGCACCGTAA AAAACGGGCG AAGGTCAAAA ATACCAGTTC CCTGGAAACC ATTGAAGGCG TCGGGCCAAA ACGTCGGCAA ATGTTGTTGA AATATATGGG CGGTTTGCAA GGTTTACGTA ACGCCAGCGT CGAGGAAATT GCAAAAGTGC CGGGTATTTC GCAAGGTCTG GCAGAAAAGA TCTTCTGGTC GTTGAAACAT TGA
|
Protein sequence | MSDQFDAKAF LKTVTSQPGV YRMYDAGGTV IYVGKAKDLK KRLSSYFRSN LASRKTEALV AQIQQIDVTV THTETEALLL EHNYIKLYQP RYNVLLRDDK SYPFIFLSGD THPRLAMHRG AKHAKGEYFG PFPNGYAVRE TLALLQKIFP IRQCENSVYR NRSRPCLQYQ IGRCLGPCVE GLVSEEEYAQ QVEYVRLFLS GKDDQVLTQL ISRMETASQN LEFEEAARIR DQIQAVRRVT EKQFVSNTGD DLDVIGVAFD AGMACVHVLF IRQGKVLGSR SYFPKVPGGT ELSEVVETFV GQFYLQGSQM RTLPGEILLD FNLSDKTLLA DSLSELAGRK INVQTKPRGD RARYLKLART NAATALTSKL SQQSTVHQRL TALASVLKLP EVKRMECFDI SHTMGEQTVA SCVVFDANGP LRAEYRRYNI TGIRPGDDYA AMNQVLRRRY GKAIDDSKIP DVILIDGGKG QLAQAKNVFA ELDVSWDKNH PLLLGVAKGA DRKAGLETLF FEPEGEGFSL PPDSPALHVI QHIRDESHDH AIGGHRKKRA KVKNTSSLET IEGVGPKRRQ MLLKYMGGLQ GLRNASVEEI AKVPGISQGL AEKIFWSLKH
|
| |