Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2520 |
Symbol | uvrC |
ID | 4076522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2659370 |
End bp | 2661277 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 638007844 |
Product | excinuclease ABC subunit C |
Protein accession | YP_614514 |
Protein GI | 99082360 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0322] Nuclease subunit of the excinuclease complex |
TIGRFAM ID | [TIGR00194] excinuclease ABC, C subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGA ATCATATGTC AGAGACTATG AACGACATCA GCGCAGAATC GCCCGACCAA CCCGAGCCCC CCCGCACCGG ATACGGTGTC ATTCAGGCAT ACCTGAAGAC GCTTGATTCA TCGCCGGGCG TGTACCGTAT GCTCGATCAC GAAAGCAGGG TGCTCTATGT CGGCAAGGCG CGCAATCTGC GGGCGCGGGT GTCCAATTAC ACCCGCCCGG GACATACGCA GCGCATCGAG ACGATGATCT CACAAACCAG CCGCATGATG TTCCTGACAA CGCGCACCGA AACCGAGGCG CTCTTGTTGG AGCAAAACCT CATCAAGCAG CTGAAGCCCA AATACAATGT GCTGCTGCGC GACGATAAAA GTTTTCCCTA CATCATGGTG AGCAAGAACC ATGCTTTTCC GCAGCTGAAA AAGCACCGCG GCGCACGCAA GGGGAAGGCG AGCTTCTTTG GGCCCTTTGC CAGTGCCGGG GCGGTGAACC GGACGTTGAA TCAGTTGCAA AAAGCGTTCT TGCTGCGCAA CTGCACAGAC ACCATGTTTG AAAACCGCAC CCGGCCCTGC CTGCAGTATC AGATCAAACG CTGCTCTGGC CCATGTGTGG GAAAGATCTC GCAGGCGGAT TACGCCGACA GTGTCCGGGA TGCAGAGCGG TTTCTGGCGG GACGCTCCAC AAAGATCCAG GAAGAGCTTG GCGCTGAGAT GCAAGCCGCC TCGGAAGCGA TGGAATATGA GCGCGCGGCA GCCTTGCGGG ACCGGATCAA GGCGCTGACG CAGGTGCAAT CGGCGCAGGG CATCAACCCG CGTGGCGTGT CCGAGGCTGA CATCATTGGC CTGCATTTGG AAAACGGGCT GGCCTGCGTG CAGGTGTTTT TTATTCGCGC CAATCAGAAC TGGGGCAATC AGGACTTCTA CCCGCGCGTG GCCGAGGATA TGTCCGCCGC CGAAGTCATG GAGGCTTTCA TTGGCCAGTT CTATGACAAC AAGGATGTTC CACGTCAGCT CATCTTGTCG GATGACATCG AAAACGCAGA TCTGATGGCT GTGGCACTCA GTGAGAAAGC CCGGCGCAAG GTGGAAATCG TGGTGCCCCA GCGGGGCGAG AAGACCGAGC TTGTGGCCTC GGCTGTGCGC AATGCCCGTG AAAGCCTCGC TCGCCGGATG TCCGAGAGCG CCACACAGGC CAAACTTTTG CGCGGCATTG CTGATGCTTT TGGGCTGGAA GCTCCGCCAA ACCGCATCGA GGTTTACGAC AACTCTCACA TTCAGGGCAC CAACGCCGTC GGTGGCATGA TCGTCATGGG GCCTGAGGGC TTTATGAAAA ACGCCTATCG TAAGTTCAAC ATCAAGGATG GTGAGGTCAT TGCAGGCGAT GACTTTGGCA TGATGAAGGC GGTGCTGAAC CGCCGCTTCT CCCGCCTGTT GAAAGAAGAC CCCGACCGCC AAAAGGGCAT GTGGCCGGAT CTTCTGCTCA TTGACGGCGG TGCGGGGCAG GTATCGGCCG TGGCCGAGAT CATGGAGGAG CATGGCGTGC AGGACATTCC CATGGTCGGG GTGGCCAAGG GTGTCGATCG CGACCATGGC AAGGAGGAGT TCTACCGCCC CGGCGAAAAC GCCTTTGCGC TGCAACGCAA TGATCCTGTG CTTTACTTCA TTCAACGCAT GCGCGACGAG GCGCACCGGT TTGCCATCGG CACCCACCGG GCCAAGCGGG CAAAATCTCT TGTGGCCAAT CCATTGGATG ACATTCCCGG CGTCGGCGCG CGTCGCAAGA AGGCACTTCT GACGCATTTT GGCAGCGCCA AGGCGGTGAG CCGCGCGAAC CTGTCGGATC TCAAGGCGGT GGACGGCGTC TCAGACGCGC TGGCGGAAAC GATCTACAAC TATTTTCAGG TGCGCTGA
|
Protein sequence | MAQNHMSETM NDISAESPDQ PEPPRTGYGV IQAYLKTLDS SPGVYRMLDH ESRVLYVGKA RNLRARVSNY TRPGHTQRIE TMISQTSRMM FLTTRTETEA LLLEQNLIKQ LKPKYNVLLR DDKSFPYIMV SKNHAFPQLK KHRGARKGKA SFFGPFASAG AVNRTLNQLQ KAFLLRNCTD TMFENRTRPC LQYQIKRCSG PCVGKISQAD YADSVRDAER FLAGRSTKIQ EELGAEMQAA SEAMEYERAA ALRDRIKALT QVQSAQGINP RGVSEADIIG LHLENGLACV QVFFIRANQN WGNQDFYPRV AEDMSAAEVM EAFIGQFYDN KDVPRQLILS DDIENADLMA VALSEKARRK VEIVVPQRGE KTELVASAVR NARESLARRM SESATQAKLL RGIADAFGLE APPNRIEVYD NSHIQGTNAV GGMIVMGPEG FMKNAYRKFN IKDGEVIAGD DFGMMKAVLN RRFSRLLKED PDRQKGMWPD LLLIDGGAGQ VSAVAEIMEE HGVQDIPMVG VAKGVDRDHG KEEFYRPGEN AFALQRNDPV LYFIQRMRDE AHRFAIGTHR AKRAKSLVAN PLDDIPGVGA RRKKALLTHF GSAKAVSRAN LSDLKAVDGV SDALAETIYN YFQVR
|
| |