Gene Achl_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1820 
SymboluvrA 
ID7293280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp2060186 
End bp2063107 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content65% 
IMG OID643590225 
Productexcinuclease ABC subunit A 
Protein accessionYP_002487885 
Protein GI220912576 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000137397 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCCTAAAG CCGTAGCCGA AGAAACAGCC GTCCCCGCTT CCTTCACCGC CACCTCCGCC 
GACACCCCCC AACGCCACGA TCTCTCCCGG CTTGTGGTCA AGGGCGCGCG CGAGCACAAC
CTGCGCAACG TGGACCTCGA CCTGCCCCGC GACGCCATGA TCGTCTTCAC CGGACTGTCC
GGTTCCGGTA AATCCTCGCT CGCCTTCGAC ACCATCTTCG CCGAAGGCCA GCGACGCTAC
GTCGAGTCCC TTTCCGCCTA CGCGCGCCAG TTCCTGGGCC AGGTGGACAA GCCCGACGTC
GACTTCATCG AAGGACTGTC GCCGGCGGTC TCCATCGACC AGAAGTCCAC CAGCAAGAAC
CCCCGGTCAA CGGTGGGAAC CATCACCGAG ATCTACGACT ACATGCGCCT CCTGTGGGCA
CGCGTCGGCC GGCCGCACTG CCCCGTGTGT GGCGAGCCCG TGGCCAAGCA GACGCCGCAG
CAGATCGTGG ACCAGCTGCT GGAGCTTGAG GACGGAACCC GCTTCCAGGT CCTCGCACCC
GTAGTCCGGG GACGCAAGGG CGAATTCGTC GACCTCTTCA AGGAGCTTTC GGCCAAGGGG
TACTCCAGGG CACGGGTTGA CGGCGACCTC ATCCAGCTCA GCGATCCGCC CAAGCTCGGC
AAGCAGTACA AGCACACCAT CGAAGTGGTG GTGGACCGCC TGGTGGTCAA GGAAGGCATC
AGCCAGCGCC TTACCGACTC CATCGAAACC GCCCTCGGCC TCGCCGAAGG CCGGGTCCTC
GCCGAATTCG TCGACCTCGA CGCAGACGAC CCCGGCCGCA CGCGGGCGTT CTCCGAAAAC
CTGGCCTGCC CCAACGAACA CCCGCTGGCC ATCGACGAAA TCGAACCACG GTCCTTCTCC
TTCAACAACC CGTTCGGAGC CTGCGCTGCC TGCAGCGGCA TCGGCACCAA GCTGGAAGTG
GATGACGAAC TGATCGTCCC CAACCCCGAG CTCTCCCTGT CGGAAGGCGC CATCGCGCCA
TGGTCCCTGG GAACCGCAAC CACAGAGTAC TGGAACCGGC TCCTGGAGGG CCTGGCCAAG
GAACTCGGGT TCTCCATGGA CACCCCCTGG GAAAAACTGG GCAAGGACGT CCGCCAGACG
GTCCTGCACG GCAAGGACCA CAAAGTGGTG GTGCAGTACC GCAACCGGTT TGGCCGCGAA
CGCAAGTACA GCACCGGCTT CGAAGGCGCC ATCCAGTACG TCCACCGCAA GCACGGCGAA
ACCGACTCTG ACTGGGCGCG CGACCGCTAC GAAGAGTACA TGCGCCAGAT TCCCTGCCCT
GCCTGCAACG GAGCGCGCCT GAACCCGGCT TCACTGTCGG TCCTGATCAA TGGCAAGTCC
ATCGCCGAGG TGGCAGCTCT GCCCATGCGT GAATGCGCGG CGTTCCTGGA CAACCTTGTC
CTCACCGGCC GTGAAGCGCA GATTGCCCAC CAGGTGCTCA AGGAGATCCA GGCCAGGCTG
ACCTTCCTCC TGGATGTGGG ACTGGAGTAC CTGAACCTGG AGCGCCCGTC CGCCACCCTG
TCCGGCGGCG AGGCGCAGCG TATCCGCCTG GCCACCCAGA TCGGTTCCGG CCTGGTGGGT
GTCCTCTATG TCTTGGACGA ACCGTCCATC GGTCTTCACC AGCGCGACAA CCGCCGCCTG
ATCGACACCC TCACCAGGCT CCGCGACATG GGCAACACCC TGATCGTGGT GGAGCATGAC
GAGGACACCA TCCATGTGGC GGACTGGATC GTCGACATCG GACCCGGCGC CGGTGAGCAC
GGCGGCCAGG TGGTCCACTC GGGTTCCTAT AAGGAGCTCC TCGACAACAC GGACTCCCTG
ACCGGCGACT ACCTGTCCGG CCGTAAGAGT ATCGAGATTC CCAAGAAGCG CCGCAAGTAC
GACAAGAAGC GTGAACTGAA GGTCGTCGGC GCGCGGGAGA ACAACCTCAC GAACGTGGAC
GCAACGTTCC CGCTGGGCCT GCTGACCGCC GTCACGGGCG TCAGTGGCTC CGGCAAGTCC
ACGCTCGTCA ACGAAATCCT CTACAAGGTG CTCGCCAACA AGCTCAACGG GGCCAAGCAG
GTGGCAGGCC GTCACAAGAC GGTCCAGGGC CTCGAACACC TCGACAAGGT GGTCCACGTT
GACCAGAGCC CCATCGGGCG GACACCGCGT TCAAACCCCG CCACCTACAC CGGCGTGTTC
GACAACATCC GCAAGCTTTT CGCCGAGACC ACCGAAGCGA AGGTCCGCGG TTACCTGCCC
GGCCGGTTCT CCTTCAACGT CAAGGGCGGC CGCTGCGAAG CATGCTCGGG CGACGGCACC
CTGAAGATCG AGATGAACTT CCTCCCGGAC GTCTACGTGC CCTGCGAGGT GTGCCATGGC
GCCCGGTACA ACCGGGAAAC CCTTGAAGTC CACTACAAGG GCAAGACCAT CGCCGATGTC
CTCAACATGC CCATCGAGGA AGGTGCCGAG TTTTTCGCGG CATTTACGCC CATCGCACGG
CACCTGAACA CGCTCGTGGA CGTCGGCCTG GGCTACGTCC GCCTCGGTCA GCCCGCCACC
ACCCTCTCCG GTGGCGAGGC CCAGCGCGTG AAACTCGCAG CCGAGCTGCA GAAGCGGTCC
AACGGCCGCA GCGTCTACGT CCTGGACGAG CCCACCACGG GCCTGCACTT CGAGGACATC
CGGAAGCTGC TGCTGGTCCT GCAGGGTCTG GTGGACAAGG GCAACACGGT CATCACCATC
GAGCACAACC TTGACGTCAT CAAGAGCGCG GACTGGATCG TTGACCTGGG GCCCGACGGC
GGCTCCGGCG GCGGCAAGAT CGTGGCCACG GGAACCCCCG AGCAGGTGGC CACGTCCACC
ACCAGCCACA CCGCCGCGTT CCTGGCCGAA ATCCTCAGCT GA
 
Protein sequence
MPKAVAEETA VPASFTATSA DTPQRHDLSR LVVKGAREHN LRNVDLDLPR DAMIVFTGLS 
GSGKSSLAFD TIFAEGQRRY VESLSAYARQ FLGQVDKPDV DFIEGLSPAV SIDQKSTSKN
PRSTVGTITE IYDYMRLLWA RVGRPHCPVC GEPVAKQTPQ QIVDQLLELE DGTRFQVLAP
VVRGRKGEFV DLFKELSAKG YSRARVDGDL IQLSDPPKLG KQYKHTIEVV VDRLVVKEGI
SQRLTDSIET ALGLAEGRVL AEFVDLDADD PGRTRAFSEN LACPNEHPLA IDEIEPRSFS
FNNPFGACAA CSGIGTKLEV DDELIVPNPE LSLSEGAIAP WSLGTATTEY WNRLLEGLAK
ELGFSMDTPW EKLGKDVRQT VLHGKDHKVV VQYRNRFGRE RKYSTGFEGA IQYVHRKHGE
TDSDWARDRY EEYMRQIPCP ACNGARLNPA SLSVLINGKS IAEVAALPMR ECAAFLDNLV
LTGREAQIAH QVLKEIQARL TFLLDVGLEY LNLERPSATL SGGEAQRIRL ATQIGSGLVG
VLYVLDEPSI GLHQRDNRRL IDTLTRLRDM GNTLIVVEHD EDTIHVADWI VDIGPGAGEH
GGQVVHSGSY KELLDNTDSL TGDYLSGRKS IEIPKKRRKY DKKRELKVVG ARENNLTNVD
ATFPLGLLTA VTGVSGSGKS TLVNEILYKV LANKLNGAKQ VAGRHKTVQG LEHLDKVVHV
DQSPIGRTPR SNPATYTGVF DNIRKLFAET TEAKVRGYLP GRFSFNVKGG RCEACSGDGT
LKIEMNFLPD VYVPCEVCHG ARYNRETLEV HYKGKTIADV LNMPIEEGAE FFAAFTPIAR
HLNTLVDVGL GYVRLGQPAT TLSGGEAQRV KLAAELQKRS NGRSVYVLDE PTTGLHFEDI
RKLLLVLQGL VDKGNTVITI EHNLDVIKSA DWIVDLGPDG GSGGGKIVAT GTPEQVATST
TSHTAAFLAE ILS