Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1040 |
Symbol | uvrC |
ID | 6268709 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | - |
Start bp | 955144 |
End bp | 956976 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641725182 |
Product | excinuclease ABC subunit C |
Protein accession | YP_001879701 |
Protein GI | 187730924 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000000190411 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTGATC AGTTTGACGC AAAAGCGTTT TTAAAAACCG TAACCAGCCA GCCAGGCGTT TATCGCATGT ACGATGCTGG TGGTACGGTT ATCTATGTAG GCAAAGCGAA AGACCTGAAA AAACGGCTTT CCAGCTATTT CCGTAGCAAC CTCGCTTCGC GCAAAACCGA AGCGCTGGTC GCCCAGATTC AGCAAATTGA TGTAACGGTT ACTCATACAG AAACCGAAGC GCTGTTGCTG GAACACAACT ACATCAAACT CTATCAGCCG CGTTACAACG TTTTGCTACG CGATGATAAA TCTTATCCTT TTATCTTCCT GAGTGGCGAT ACCCACCCGC GTCTGGCGAT GCATCGTGGT GCGAAGCATG CCAAAGGTGA ATATTTCGGC CCGTTCCCGA ATGGCTATGC CGTACGTGAA ACTCTGGCGC TACTGCAAAA GATTTTCCCC ATTCGCCAGT GCGAAAATAG TGTTTATCGC AATCGCTCGC GTCCGTGTCT GCAATATCAG ATAGGGCGCT GTCTGGGACC GTGCGTTGAA GGACTGGTGA GTGAAGAAGA ATACGCTCAG CAGGTCGAGT ATGTGCGCCT GTTTTTGTCT GGCAAAGATG ATCAGGTGCT TACGCAACTC ATTAGCCGTA TGGAAACTGC CAGTCAGAAT CTGGAGTTTG AAGAAGCGGC ACGTATTCGC GACCAAATTC AGGCGGTGCG ACGCGTCACC GAAAAACAGT TTGTTTCCAA TACCGGCAAC GACCTTGACG TTATTGGTGT GGCGTTCGAT GCGGGCATGG CTTGTGTCCA CGTATTGTTC ATTCGTCAGG GCAAAGTGCT CGGCAGCCGC AGCTATTTCC CGAAAGTGCC TGGCGGTACG GAACTGAGCG AGGTGGTAGA AACCTTCGTA GGTCAGTTCT ATTTACAAGG CAGCCAGATG CGCACCTTAC CGGGTGAGAT CCTGCTCGAT TTTAATCTTA GCGATAAAAC GCTGCTCGCC GATTCCCTTT CAGAACTGGC GGGACGCAAG ATTAATGTTC AAACCAAACC TCGCGGCGAT AGGGCGCGTT ATCTGAAACT CGCGCGCACC AATGCGGCGA CGGCCTTAAT CAGCAAACTT TCGCAGCAAT CTACCGTTCA CCAGCGACTG ACCGCGCTTG CCAGCGTGTT GAAATTGCCG GAAGTGAAGC GGATGGAGTG CTTTGACATC AGCCATACCA TGGGCGAACA AACCGTCGCT TCCTGTGTGG TGTTTGATGC TAACGGCCCG CTGCGTGCGG AGTATCGGCG CTATAACATT ACAGGCATCA CGCCGGGCGA TGATTATGCG GCGATGAATC AGGTGCTGCG TCGGCGTTAT GGTAAAGCCA TTGACGACAG TAAGATCCCG GATGTGATCC TTATCGACGG CGGCAAAGGC CAGCTTGCGC AGGCGAAAAA TGTCTTCGCC GAACTGGATG TCTCATGGGA TAAAAATCAT CCGCTGCTAC TTGGCGTTGC CAAAGGAGCA GATCGTAAGG CTGGACTGGA AACGCTGTTC TTTGAGCCGG AAGGTGAGGG ATTTAGTTTG CCGCCAGATT CACCCGCGCT GCATGTTATC CAGCATATTC GCGATGAATC ACATGATCAC GCGATTGGCG GGCACCGTAA AAAACGGGCG AAGGTCAAAA ATACCAGTTC CCTGGAAACC ATTGAAGGCG TCGGGCCAAA ACGTCGGCAA ATGTTGTTGA AATATATGGG CGGTTTGCAA GGTTTACGTA ACGCCAGCGT CGAGGAAATT GCAAAAGTGC CGGGTATTTC GCAAGGTCTG GCAGAAAAGA TCTTCTGGTC GTTGAAACAT TGA
|
Protein sequence | MSDQFDAKAF LKTVTSQPGV YRMYDAGGTV IYVGKAKDLK KRLSSYFRSN LASRKTEALV AQIQQIDVTV THTETEALLL EHNYIKLYQP RYNVLLRDDK SYPFIFLSGD THPRLAMHRG AKHAKGEYFG PFPNGYAVRE TLALLQKIFP IRQCENSVYR NRSRPCLQYQ IGRCLGPCVE GLVSEEEYAQ QVEYVRLFLS GKDDQVLTQL ISRMETASQN LEFEEAARIR DQIQAVRRVT EKQFVSNTGN DLDVIGVAFD AGMACVHVLF IRQGKVLGSR SYFPKVPGGT ELSEVVETFV GQFYLQGSQM RTLPGEILLD FNLSDKTLLA DSLSELAGRK INVQTKPRGD RARYLKLART NAATALISKL SQQSTVHQRL TALASVLKLP EVKRMECFDI SHTMGEQTVA SCVVFDANGP LRAEYRRYNI TGITPGDDYA AMNQVLRRRY GKAIDDSKIP DVILIDGGKG QLAQAKNVFA ELDVSWDKNH PLLLGVAKGA DRKAGLETLF FEPEGEGFSL PPDSPALHVI QHIRDESHDH AIGGHRKKRA KVKNTSSLET IEGVGPKRRQ MLLKYMGGLQ GLRNASVEEI AKVPGISQGL AEKIFWSLKH
|
| |