Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3427 |
Symbol | |
ID | 7294908 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3798175 |
End bp | 3799875 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643591834 |
Product | HNH endonuclease |
Protein accession | YP_002489473 |
Protein GI | 220914164 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGGGTC TTCAGGCCTC TGTCGCCGGC CTTGACGCTC TTTTCCTGGA GGACGCGGAC CTGGACCGGG CTGACCCTGA CGCGGAGGGC GAGGCTGCGG TGGTTGACCT GTTGAAGCGG AAGTCGGAGG TTCGGCTGCA GCGGCTGGCG TTGTGGAAGC AGTTGGCGGC GCAGGCCGCG GCCGGAATGG CAGCGGATGC GGCGGAGTTC GCGGAGTTTC AGGAGGCGAT GACGCCGCCG GAGGTCACAG GGTCGGAGGG GGCGTTCGTG GAGATGTCCA CGACGGCGGA GATCGCCGGC GTCCTGACGC TCAGCCCGGG CGCCGCGTCA GCCTTCATCA GCCAGTCACG GAAAGTCTGC GCGATGCCAC CGGTAGCGGC CGCGCTGTCC GCTGGGTTGA TGTCGTGGCG GCACGCGGTG ATCGTGGCCG ACGAAGCCGA CTGCCTCGCC CCCGAGAGTG CCGAGGCGCT GGTGACGCAT TTCTTCGACC CCGACGCACC CAACCGAGCG CGCGGGTCGG CGCCCGGTGA CCTCGTGCCG CACCTGTTCC GCCGGAAAGT ACGGACCTGG CGCGAACGCA CCTACCCTCA GACAGTCCAG GAACGGCACG CGAAGTGTGT GGCGGACCGG CGGATGGAAT ACCGGCCCGA TGCCGACGGG ATGGCTTCGA TTACCCTGAT CCTGTCCGGG GACACCGCCT GCGCCATCTG GAACAAAACC ACCGCCATCG CCCGCGGCCT GCAGGGACCC GGTGAGACCC GCACCCTCAC CCAACTCCGC CCCGACACCG CCGCCGCGCT CCTGCTCGGT GCCCACACCG GGACCGCTGC CGCAGGTCGG TTGGACGGTG GCGAAGGATC CGGCGGCGTA TCCGTCGTCG ATAGTCTTCC GGGTGCCAGT GGCAACGGGA GCGACCCTTA CGCGATTGAT CTCAGCAAGG TCCCCGCACC GAAAGCCGAC GTCCTGGTTA CCATCCCGCT ATTTACACTG CTGGGGGCCA CCGACGAACC TGCCGACCTT GACGGGTACG GCCCCATCCC CGCAGCGATG GCCCGGAAAC TCGTTGCCGA CGGGGCTGCC TCGTTTTACC GGGTCCTCGT CGACCCGCGA GACGGCGCAC CGCTCGAGAT CGGACGGACC AGCTACCGGC TGTCGGAGGC GATGAAACGC TGGATCAGGA TGCGCGACGG ACACTGCACG TTCCCCGGCT GCACCAACCC CAGCACAGAC AACGACACCG ACCACCTCAC CGCCTGGCAG CACGACGGGA CAACCGGGGT GAGCAACCTG GCACAGCTCT GCCCGAAACA TCACCGCCTC AAACACAACA GCGGCTGGAC CCCCACACCA GCCAGCACAA CCAAACCACC CGGCTGGACC TCACCCACAG GACGGCACTA CCCTGGCCAA CACCCCGACC CGCGACCACC ACACTTGCCA CCCGGGCTGC TTGAACAGAA ACAACCCGTG ACCGGGGACA TTGGCGCGGC CGAACCGCCG CACGCCGCCT CACACACCGC AGCGGACAGA CCCCTGAAAT CTGTCAGGGA CCGTGAGCCG GCGAGCGACC ATGCGCCAGC CAGAGACGTT GTTGCTGCGG GCGAGTCTGG TGCGACCGGA GAACTGGGCG TGCCGGCAAT CAACCGGCCG GATGAACTAA GCCCGCTCGA ACGCACACTC ACGGACCATC TCGCCGCCTG A
|
Protein sequence | MEGLQASVAG LDALFLEDAD LDRADPDAEG EAAVVDLLKR KSEVRLQRLA LWKQLAAQAA AGMAADAAEF AEFQEAMTPP EVTGSEGAFV EMSTTAEIAG VLTLSPGAAS AFISQSRKVC AMPPVAAALS AGLMSWRHAV IVADEADCLA PESAEALVTH FFDPDAPNRA RGSAPGDLVP HLFRRKVRTW RERTYPQTVQ ERHAKCVADR RMEYRPDADG MASITLILSG DTACAIWNKT TAIARGLQGP GETRTLTQLR PDTAAALLLG AHTGTAAAGR LDGGEGSGGV SVVDSLPGAS GNGSDPYAID LSKVPAPKAD VLVTIPLFTL LGATDEPADL DGYGPIPAAM ARKLVADGAA SFYRVLVDPR DGAPLEIGRT SYRLSEAMKR WIRMRDGHCT FPGCTNPSTD NDTDHLTAWQ HDGTTGVSNL AQLCPKHHRL KHNSGWTPTP ASTTKPPGWT SPTGRHYPGQ HPDPRPPHLP PGLLEQKQPV TGDIGAAEPP HAASHTAADR PLKSVRDREP ASDHAPARDV VAAGESGATG ELGVPAINRP DELSPLERTL TDHLAA
|
| |