Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1098 |
Symbol | uvrA |
ID | 4077805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 1177570 |
End bp | 1180461 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006402 |
Product | excinuclease ABC subunit A |
Protein accession | YP_613093 |
Protein GI | 99080939 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGAGT TGAAGAACAT TGAGGTGCGC GGCGCGCGCG AGCACAATCT CAAGAACATC GACGTGGACA TCCCGCGGGA TGAACTGGTG GTGATCACGG GCCTCTCCGG TTCCGGCAAG TCGAGCCTTG CCTTTGACAC CATCTACGCC GAGGGGCAGC GCCGCTATGT CGAATCCCTC AGCGCCTATG CGCGCCAGTT CCTCGATATG ATGGAAAAGC CCGATGTGGA TCATATCTCG GGCCTGTCGC CTGCGATCTC CATCGAGCAG AAGACCACAT CGAAGAACCC GCGCTCGACC GTCGGCACCG TCACCGAGAT CTATGATTAT CTGCGACTGC TGTTTGCCCG CGCGGGCACG CCCTACAGCC CTGCCACCGG CCAGCCCATC GAGGCGCAGC AGGTGCAGGA TATGGTCGAC CGGATCATGA CGCTTGAGGA GGGCACGCGG GGCTATCTGC TGGCGCCCAT CGTGCGCGAC CGCAAGGGCG AGTACAAAAA AGAGATGCTG GAGCTCAGGA AACAGGGGTT CCAGCGCGTG AAGGTGGACG GGGAGTTTCA CGATCTTGAT ACGCCGCCCA CGCTCGACAA GAAGTTCCGC CATGACATCG ACGTGGTGGT CGACCGGCTG GTTGTAAAAG AGGGCATCGA GACCCGGCTG GCGGACAGCT TGCGCACTGC GCTGGATCTG GCCGACGGGA TCGCCATTCT GGAGACCGCC CCGCGTGAGG GCGACCCGGA GCGGATCACC TTTTCCGAGA ATTTCGCCTG TCCCGTCAGT GGCTTCACCA TCCCCGAGAT CGAACCACGG CTGTTTTCCT TTAACGCGCC CTTTGGGGCC TGTCCGCATT GTGACGGTCT GGGCGTGGAG CTGTTCTTTG ACGAACGTCT CGTGGTACCG GATCAGAGCC TCAAGGTTTA TGACGGGGCG CTGGCGCCCT GGCGCAAGGG CAAATCGCCC TATTTCCTGC AGACTATCGA GGCCATCGCC AACCACTATG AGTTCGACAA GAACACGCCG TGGAAGGATC TGCCCGCGCA TGTGAAACAG GTGTTCCTGC ATGGCTCCGG CGACGAGGAA ATCGCCTTTC GCTATGACGA AGGCGGGCGC GTCTACAATG TGACGCGCGT CTTTGAGGGC GTGATCCCCA ATATGGAGCG CCGCTACCGC GAAACGGATT CCAACTGGGT GCGCGAGGAG TTCGAGCGCT ACCAGAACAA CCGCGACTGC GGCCATTGTG GTGGGTATCG TCTGCGCGAA GAGGCGCTGG CGGTCAAAAT CGGCCCGGCG GGAGGCCCCG CCGAGCATCG TCTGCATGTG GGGCAGGTGG TGGAGAAATC CATCCGCGAG GCGCTGGCGT GGATCGAAGA GGTGCCGAGC CATCTCAGCC CGCAAAAGCA GGAGATCGCC CGCGCCATCG TCAAGGAAAT CCGCGAGCGT CTTGGGTTCC TCAACAATGT GGGGCTTGAG TATCTGACCC TCTCGCGCAA CGCGGGCACG CTCTCGGGCG GGGAAAGCCA GCGGATTCGT CTGGCGAGCC AGATCGGCTC TGGCCTGACC GGGGTGCTCT ATGTGTTGGA CGAGCCCTCC ATCGGCCTGC ACCAGCGTGA CAATGACCGG CTGATCACCA CGCTCAAGAA CCTGCGCGAT CAGGGCAACA CGGTGATCGT GGTGGAACAT GACGAGGATA TGATCCGGCA GGCCGATTAC GTCTTTGATA TTGGTCCCGG CGCCGGGGTG CACGGCGGGC AGGTTGTCAG CCACGGCACG CCCGCCACGG TTGAGGGTGA TGCGGGTTCG GTCACCGGTC AGTATCTGGC CGGAACGCGT GAGATTGCGG TGCCGGATAC GCGCCGCAAG GGCAACAAGA AGAAGATCAA GGTGGTGAAG GCCTCCGGCA ACAACCTGAA GGACGTCACC GCCGAATTCC CGCTGGGGAA ATTTGTCTGC GTCACCGGTG TGTCGGGCGG TGGCAAGTCC ACGCTGACCA TCGAGACGCT GTTCAAGACT GCCTCGATGC GTCTCAACGG GGCGCGCCAG ACGCCTGCGC CTTGCGAAAC CATCAAGGGG CTCGAGCATC TGGACAAGGT GATCGACATC GACCAGCGCC CCATCGGGCG CACGCCACGC TCGAACCCGG CGACCTATAC CGGGGCCTTC ACGCCGATCC GCGACTGGTT TGCCGGCCTG CCCGAAGCCA AGGCGCGCGG GTATAAACCG GGGCGGTTTT CCTTTAACGT GAAGGGCGGA CGCTGCGAGG CCTGTCAGGG TGACGGGGTG CTGAAGGTCG AGATGCATTT CCTGCCCGAC GTCTATGTCA CCTGCGAGAC CTGTCAGGGC GCGCGCTACA ACCGCGAGAC GCTGGAGATC AAGTTCAAGG GCAAGAGCAT TGCCGATGTG CTGGATATGA CAGTGGAGGA TGCGCAGGAG TTCTTTGCCG CCGTGCCGAC GATCCGCGAC AAGATGGACG CGCTGATGCG GGTGGGTCTT GGCTATATCA AGGTCGGCCA GCAGGCCACC ACACTGTCGG GCGGCGAGGC CCAGCGGGTG AAACTCTCAA AGGAACTCGC CAAACGGTCG ACGGGCCGCA CGCTTTATAT CCTGGATGAG CCGACCACCG GTCTGCATTT TGAGGATGTG CGCAAGCTCT TGGAAGTTCT GCATGAATTG GTTGAGCAGG GCAATTCCGT GGTGGTGATC GAACACAACC TCGACGTGAT CAAGACGGCT GATTGGCTGA TCGACATTGG CCCCGAAGGC GGTGATGGCG GCGGCGAGAT TGTCGCTGTG GGGACGCCGG AAAAGGTCGC CGAAGAGCCG CGCAGTCACA CCGGGCGCTA TCTGAAGCCG ATGCTGGAAG CGCAGGCGCG CAAGAAGGTC GCGGCGGAGT GA
|
Protein sequence | MAELKNIEVR GAREHNLKNI DVDIPRDELV VITGLSGSGK SSLAFDTIYA EGQRRYVESL SAYARQFLDM MEKPDVDHIS GLSPAISIEQ KTTSKNPRST VGTVTEIYDY LRLLFARAGT PYSPATGQPI EAQQVQDMVD RIMTLEEGTR GYLLAPIVRD RKGEYKKEML ELRKQGFQRV KVDGEFHDLD TPPTLDKKFR HDIDVVVDRL VVKEGIETRL ADSLRTALDL ADGIAILETA PREGDPERIT FSENFACPVS GFTIPEIEPR LFSFNAPFGA CPHCDGLGVE LFFDERLVVP DQSLKVYDGA LAPWRKGKSP YFLQTIEAIA NHYEFDKNTP WKDLPAHVKQ VFLHGSGDEE IAFRYDEGGR VYNVTRVFEG VIPNMERRYR ETDSNWVREE FERYQNNRDC GHCGGYRLRE EALAVKIGPA GGPAEHRLHV GQVVEKSIRE ALAWIEEVPS HLSPQKQEIA RAIVKEIRER LGFLNNVGLE YLTLSRNAGT LSGGESQRIR LASQIGSGLT GVLYVLDEPS IGLHQRDNDR LITTLKNLRD QGNTVIVVEH DEDMIRQADY VFDIGPGAGV HGGQVVSHGT PATVEGDAGS VTGQYLAGTR EIAVPDTRRK GNKKKIKVVK ASGNNLKDVT AEFPLGKFVC VTGVSGGGKS TLTIETLFKT ASMRLNGARQ TPAPCETIKG LEHLDKVIDI DQRPIGRTPR SNPATYTGAF TPIRDWFAGL PEAKARGYKP GRFSFNVKGG RCEACQGDGV LKVEMHFLPD VYVTCETCQG ARYNRETLEI KFKGKSIADV LDMTVEDAQE FFAAVPTIRD KMDALMRVGL GYIKVGQQAT TLSGGEAQRV KLSKELAKRS TGRTLYILDE PTTGLHFEDV RKLLEVLHEL VEQGNSVVVI EHNLDVIKTA DWLIDIGPEG GDGGGEIVAV GTPEKVAEEP RSHTGRYLKP MLEAQARKKV AAE
|
| |