Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1820 |
Symbol | uvrA |
ID | 7293280 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 2060186 |
End bp | 2063107 |
Gene Length | 2922 bp |
Protein Length | 973 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643590225 |
Product | excinuclease ABC subunit A |
Protein accession | YP_002487885 |
Protein GI | 220912576 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.000000000137397 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCTAAAG CCGTAGCCGA AGAAACAGCC GTCCCCGCTT CCTTCACCGC CACCTCCGCC GACACCCCCC AACGCCACGA TCTCTCCCGG CTTGTGGTCA AGGGCGCGCG CGAGCACAAC CTGCGCAACG TGGACCTCGA CCTGCCCCGC GACGCCATGA TCGTCTTCAC CGGACTGTCC GGTTCCGGTA AATCCTCGCT CGCCTTCGAC ACCATCTTCG CCGAAGGCCA GCGACGCTAC GTCGAGTCCC TTTCCGCCTA CGCGCGCCAG TTCCTGGGCC AGGTGGACAA GCCCGACGTC GACTTCATCG AAGGACTGTC GCCGGCGGTC TCCATCGACC AGAAGTCCAC CAGCAAGAAC CCCCGGTCAA CGGTGGGAAC CATCACCGAG ATCTACGACT ACATGCGCCT CCTGTGGGCA CGCGTCGGCC GGCCGCACTG CCCCGTGTGT GGCGAGCCCG TGGCCAAGCA GACGCCGCAG CAGATCGTGG ACCAGCTGCT GGAGCTTGAG GACGGAACCC GCTTCCAGGT CCTCGCACCC GTAGTCCGGG GACGCAAGGG CGAATTCGTC GACCTCTTCA AGGAGCTTTC GGCCAAGGGG TACTCCAGGG CACGGGTTGA CGGCGACCTC ATCCAGCTCA GCGATCCGCC CAAGCTCGGC AAGCAGTACA AGCACACCAT CGAAGTGGTG GTGGACCGCC TGGTGGTCAA GGAAGGCATC AGCCAGCGCC TTACCGACTC CATCGAAACC GCCCTCGGCC TCGCCGAAGG CCGGGTCCTC GCCGAATTCG TCGACCTCGA CGCAGACGAC CCCGGCCGCA CGCGGGCGTT CTCCGAAAAC CTGGCCTGCC CCAACGAACA CCCGCTGGCC ATCGACGAAA TCGAACCACG GTCCTTCTCC TTCAACAACC CGTTCGGAGC CTGCGCTGCC TGCAGCGGCA TCGGCACCAA GCTGGAAGTG GATGACGAAC TGATCGTCCC CAACCCCGAG CTCTCCCTGT CGGAAGGCGC CATCGCGCCA TGGTCCCTGG GAACCGCAAC CACAGAGTAC TGGAACCGGC TCCTGGAGGG CCTGGCCAAG GAACTCGGGT TCTCCATGGA CACCCCCTGG GAAAAACTGG GCAAGGACGT CCGCCAGACG GTCCTGCACG GCAAGGACCA CAAAGTGGTG GTGCAGTACC GCAACCGGTT TGGCCGCGAA CGCAAGTACA GCACCGGCTT CGAAGGCGCC ATCCAGTACG TCCACCGCAA GCACGGCGAA ACCGACTCTG ACTGGGCGCG CGACCGCTAC GAAGAGTACA TGCGCCAGAT TCCCTGCCCT GCCTGCAACG GAGCGCGCCT GAACCCGGCT TCACTGTCGG TCCTGATCAA TGGCAAGTCC ATCGCCGAGG TGGCAGCTCT GCCCATGCGT GAATGCGCGG CGTTCCTGGA CAACCTTGTC CTCACCGGCC GTGAAGCGCA GATTGCCCAC CAGGTGCTCA AGGAGATCCA GGCCAGGCTG ACCTTCCTCC TGGATGTGGG ACTGGAGTAC CTGAACCTGG AGCGCCCGTC CGCCACCCTG TCCGGCGGCG AGGCGCAGCG TATCCGCCTG GCCACCCAGA TCGGTTCCGG CCTGGTGGGT GTCCTCTATG TCTTGGACGA ACCGTCCATC GGTCTTCACC AGCGCGACAA CCGCCGCCTG ATCGACACCC TCACCAGGCT CCGCGACATG GGCAACACCC TGATCGTGGT GGAGCATGAC GAGGACACCA TCCATGTGGC GGACTGGATC GTCGACATCG GACCCGGCGC CGGTGAGCAC GGCGGCCAGG TGGTCCACTC GGGTTCCTAT AAGGAGCTCC TCGACAACAC GGACTCCCTG ACCGGCGACT ACCTGTCCGG CCGTAAGAGT ATCGAGATTC CCAAGAAGCG CCGCAAGTAC GACAAGAAGC GTGAACTGAA GGTCGTCGGC GCGCGGGAGA ACAACCTCAC GAACGTGGAC GCAACGTTCC CGCTGGGCCT GCTGACCGCC GTCACGGGCG TCAGTGGCTC CGGCAAGTCC ACGCTCGTCA ACGAAATCCT CTACAAGGTG CTCGCCAACA AGCTCAACGG GGCCAAGCAG GTGGCAGGCC GTCACAAGAC GGTCCAGGGC CTCGAACACC TCGACAAGGT GGTCCACGTT GACCAGAGCC CCATCGGGCG GACACCGCGT TCAAACCCCG CCACCTACAC CGGCGTGTTC GACAACATCC GCAAGCTTTT CGCCGAGACC ACCGAAGCGA AGGTCCGCGG TTACCTGCCC GGCCGGTTCT CCTTCAACGT CAAGGGCGGC CGCTGCGAAG CATGCTCGGG CGACGGCACC CTGAAGATCG AGATGAACTT CCTCCCGGAC GTCTACGTGC CCTGCGAGGT GTGCCATGGC GCCCGGTACA ACCGGGAAAC CCTTGAAGTC CACTACAAGG GCAAGACCAT CGCCGATGTC CTCAACATGC CCATCGAGGA AGGTGCCGAG TTTTTCGCGG CATTTACGCC CATCGCACGG CACCTGAACA CGCTCGTGGA CGTCGGCCTG GGCTACGTCC GCCTCGGTCA GCCCGCCACC ACCCTCTCCG GTGGCGAGGC CCAGCGCGTG AAACTCGCAG CCGAGCTGCA GAAGCGGTCC AACGGCCGCA GCGTCTACGT CCTGGACGAG CCCACCACGG GCCTGCACTT CGAGGACATC CGGAAGCTGC TGCTGGTCCT GCAGGGTCTG GTGGACAAGG GCAACACGGT CATCACCATC GAGCACAACC TTGACGTCAT CAAGAGCGCG GACTGGATCG TTGACCTGGG GCCCGACGGC GGCTCCGGCG GCGGCAAGAT CGTGGCCACG GGAACCCCCG AGCAGGTGGC CACGTCCACC ACCAGCCACA CCGCCGCGTT CCTGGCCGAA ATCCTCAGCT GA
|
Protein sequence | MPKAVAEETA VPASFTATSA DTPQRHDLSR LVVKGAREHN LRNVDLDLPR DAMIVFTGLS GSGKSSLAFD TIFAEGQRRY VESLSAYARQ FLGQVDKPDV DFIEGLSPAV SIDQKSTSKN PRSTVGTITE IYDYMRLLWA RVGRPHCPVC GEPVAKQTPQ QIVDQLLELE DGTRFQVLAP VVRGRKGEFV DLFKELSAKG YSRARVDGDL IQLSDPPKLG KQYKHTIEVV VDRLVVKEGI SQRLTDSIET ALGLAEGRVL AEFVDLDADD PGRTRAFSEN LACPNEHPLA IDEIEPRSFS FNNPFGACAA CSGIGTKLEV DDELIVPNPE LSLSEGAIAP WSLGTATTEY WNRLLEGLAK ELGFSMDTPW EKLGKDVRQT VLHGKDHKVV VQYRNRFGRE RKYSTGFEGA IQYVHRKHGE TDSDWARDRY EEYMRQIPCP ACNGARLNPA SLSVLINGKS IAEVAALPMR ECAAFLDNLV LTGREAQIAH QVLKEIQARL TFLLDVGLEY LNLERPSATL SGGEAQRIRL ATQIGSGLVG VLYVLDEPSI GLHQRDNRRL IDTLTRLRDM GNTLIVVEHD EDTIHVADWI VDIGPGAGEH GGQVVHSGSY KELLDNTDSL TGDYLSGRKS IEIPKKRRKY DKKRELKVVG ARENNLTNVD ATFPLGLLTA VTGVSGSGKS TLVNEILYKV LANKLNGAKQ VAGRHKTVQG LEHLDKVVHV DQSPIGRTPR SNPATYTGVF DNIRKLFAET TEAKVRGYLP GRFSFNVKGG RCEACSGDGT LKIEMNFLPD VYVPCEVCHG ARYNRETLEV HYKGKTIADV LNMPIEEGAE FFAAFTPIAR HLNTLVDVGL GYVRLGQPAT TLSGGEAQRV KLAAELQKRS NGRSVYVLDE PTTGLHFEDI RKLLLVLQGL VDKGNTVITI EHNLDVIKSA DWIVDLGPDG GSGGGKIVAT GTPEQVATST TSHTAAFLAE ILS
|
| |