Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_4502 |
Symbol | |
ID | 7280441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011881 |
Strand | + |
Start bp | 10608 |
End bp | 13475 |
Gene Length | 2868 bp |
Protein Length | 955 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643580616 |
Product | hypothetical protein |
Protein accession | YP_002478429 |
Protein GI | 219883268 |
COG category | [V] Defense mechanisms |
COG ID | [COG1002] Type II restriction enzyme, methylase subunits |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 183 |
Fosmid unclonability p-value | 0.264892 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGCCG TCAATTTGCT TGGGCTCGTC GAATCGCACT CATACCAGGT GCTTTTCCTG GCACACCTTC GTTGGTCCCG ACCCGATATG CCAGCCCTCG CCGTCGACGT TGACGGAGCG CCTGTCACCG TCACCAATGT CTCCAGCTAC AAAGGGCTAC GTGTCTGGGT TTGTCCAGTC TTGCCGAACG CCTCCCGGCA GGCGGAAATC GACCGCATCA TCGCGAAGAA GAGCACTGAT CGAATCGTGA TTTTCCATAA TGACGAGAAA CAAGTGTGGC GTTGGCCTTC CCGCAACGTT AAGGGCGGCT CCACGAGTAC CCGGCTCACC TCGCATGCAC ATGTTACCGG AACGTCGAAT CCCAAGCTTG TGGAGCGATT GCAGCTAATT ACCCTGGCAA TTACCGAAGA TCTGAACGTC ACTCAAGTCA TTGAACGGGT AAGGCAGGCC TTCGACGTGG AAACGGAGAA GGAGACAAAG CGAGCCTCGA AGCTCATGGC TTCGATGTAT GACACTCTGG CCCAGGCTGA CTGCTCTGAG CACGATATCT CTGTAACGCT CGCGCGCTTG CTCTTTCTAC TGTTCGGGGA CGACACGGAC ATGTGGACCA AGAATCTGTT TCAGGAGTTC CTCATTGAGC ACACCAGCCG GGACGGGTCC GATCTGGCAG AACGTCTGAA TGAGCTTTTT GCTCATCTCG ACACTAGGCC GGCAGACCGA ACCGAAGTAG GTCAACATCT GGCTGGCTTC AAGTACATAA ATGGTGGGCT CTTCAGTGAG CGAATTACCC TTCCCAAGGT CGGACAAGGC CTCCGAACAA CGATCCTCGA CGCCTGCTCC TCAAACTGGT CAGACATCTC TCCCGCTATA TTCGGCTCGA TGTTCCAGTC TGTTAGAGAC GATAAAACGC GTAGGGAATA CGGGGAGCAT TACACGTCCG AAACCGACAT TCTCAAAACG CTCAATCCGC TCTTCCTGGA TGAACTTCGG GATGAGTTTG CACGGGCAGT TGGACATCGT GAAGAGTACA GCCGTCTGCT AAAGCTTCGT GAAAGACTGG GAAGGATCCG GTTCCTCGAC CCAGCCTGCG GCTGCGGCAA CTTCATCATC ATCGCTTATC GGGAGATGCG GCTGCTGGAG ATTTCGGTTC TCGAGCGCAT TAGAGTCAAA GAGTCAGAGC AGCTCGTCTC GTTGGCCGCT AACCAACTCG AGTTTGGCCT CGAAGGTGAA CTGGACACAG CAGGGGGAAG CAGGACCACG TTGCTTGACC CAGTGGTCCG CCTCGACAAT TTCTTTGGCA TTGAGATTGA CGAGTGGCCA GCCAAGATAG CGGAAACAGC TATGTTCCTT ATTGACCGGC AATGCGACCT TCAGATGCTG GAAAGGTTGG GTTTCGCCCC CGAACGACTA CCCATCCAGC GCCAAGCCTC AATCATCTCG GCGACTCCCG AAAATCCTTC CGGAGGAAAC GCCTTGAAAC TCGATTGGAA GAAAATCATA GCTCCGGGTC CTGACAGCAT CATTGCGGGC AACCCACCTT TCATCGGAAT GGCCTGGATG GACAAGGCTC AGCAGAACGA CAACCGGCAA ATTTTCGCGA TGTTGCCTGA AGCCCACGGA GAGCGGACGG GACGACTTGA TTATGTTGCT TGCTGGTATG CAAAAGCCAT CGACTACCTG CGTGGAAGCC AAGCGCGAGT TGCCTTCGTC TCGACAAACT CGATCACCCA AGGGGATCAG GCTCGAGCAC TTGATCCCAT CCTTAAACGC GCTGGTGTCC ACATCGACTT TGCGCATCGC ACATTTAAAT GGCAATCAGA GGCTACCGGT GGAGCCGCCG TGCACTGCGT GATTATCGGC ATGTCAGCCA CTGGCCGGCC GCAGCGAGTC CTATTCGACT ACCCGCAACT ATCCGGAAGC CCCGTCGCTA GACCCGTCAC GGGTATAAAC ATGTATCTTT TGGATTCGGA CCTGCCTGCG CCGGTGAAGC GCCGAGTTCC CTTCTATGCA CACTTGCCAA AGATGACAAA AGGATCCCAG GCAACAGACG GTGGCCACCT GATCGTTGAG AACGCTGAGG ATTTGGCGCC CCTGAAGGCA GATCCCGTAG CCCGGAAGTA TCTTCGTCCG TTCATGCAGG GCAGAGACAT GCTGAACAGC ACTGAGAAGT GGTGTCTATG GTTTGAAGGT GCCCAGCTTG CCGATCTTCA ACAGGTACCA GCCATAGCCA CAAGACTTGA AGGGGTAGCC AACGCAAGGC TGCAGAGCCC GACTGCCTCT GTCCAGAAGC AGGCGGCCAT CCCACATCTC TTCACCCAGA GAAGACAGCC TAAAAACAGG TACCTCGCGT TGCCGGAAGT CAGTAGCGAG GACCGGGAGT ATGTCCCGAT GACATATCTA GAGGCTGACG TGATCGCGGG CAACAAGCTG ATTCACGTGG ACGCCTGCCC GGAATGGCTC TTCGGCGTAC TCCAATCCCG AATATTCATG GTTTGGCTGC GAACGTTTGC CGGGCGGTTG AAGAGCGACC TCAGCATCTC ACCTGACTTG GCATACTGCG CGTTCCCGTT CCCTGACCTT TCCGCAGACC AGACGGAAGT AATCTCCAGC TTGGCGGAAA GGATCGTAGA AACAAGACTT GAACTGGGAA AACCGCTTTC TGTTCTCTAC AAACGTGGAC AAACACCAAG CCAACTCGTC AGCTTGCATG ACAAGCTGGA CGCTGCCGTG GCCGATGCCT TTGAGCTCCA TGTAGAAGAC GAACTCGAAG TGGCCAGCGA GCTGCTGAAA CGACATCACA TCCTCGTAGG AACAGTAAAA GCGCGGGAAG TTTCACCCCC ACTGCTGGGC GCAGCGGCGA ACGCATGA
|
Protein sequence | MMAVNLLGLV ESHSYQVLFL AHLRWSRPDM PALAVDVDGA PVTVTNVSSY KGLRVWVCPV LPNASRQAEI DRIIAKKSTD RIVIFHNDEK QVWRWPSRNV KGGSTSTRLT SHAHVTGTSN PKLVERLQLI TLAITEDLNV TQVIERVRQA FDVETEKETK RASKLMASMY DTLAQADCSE HDISVTLARL LFLLFGDDTD MWTKNLFQEF LIEHTSRDGS DLAERLNELF AHLDTRPADR TEVGQHLAGF KYINGGLFSE RITLPKVGQG LRTTILDACS SNWSDISPAI FGSMFQSVRD DKTRREYGEH YTSETDILKT LNPLFLDELR DEFARAVGHR EEYSRLLKLR ERLGRIRFLD PACGCGNFII IAYREMRLLE ISVLERIRVK ESEQLVSLAA NQLEFGLEGE LDTAGGSRTT LLDPVVRLDN FFGIEIDEWP AKIAETAMFL IDRQCDLQML ERLGFAPERL PIQRQASIIS ATPENPSGGN ALKLDWKKII APGPDSIIAG NPPFIGMAWM DKAQQNDNRQ IFAMLPEAHG ERTGRLDYVA CWYAKAIDYL RGSQARVAFV STNSITQGDQ ARALDPILKR AGVHIDFAHR TFKWQSEATG GAAVHCVIIG MSATGRPQRV LFDYPQLSGS PVARPVTGIN MYLLDSDLPA PVKRRVPFYA HLPKMTKGSQ ATDGGHLIVE NAEDLAPLKA DPVARKYLRP FMQGRDMLNS TEKWCLWFEG AQLADLQQVP AIATRLEGVA NARLQSPTAS VQKQAAIPHL FTQRRQPKNR YLALPEVSSE DREYVPMTYL EADVIAGNKL IHVDACPEWL FGVLQSRIFM VWLRTFAGRL KSDLSISPDL AYCAFPFPDL SADQTEVISS LAERIVETRL ELGKPLSVLY KRGQTPSQLV SLHDKLDAAV ADAFELHVED ELEVASELLK RHHILVGTVK AREVSPPLLG AAANA
|
| |