Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_1089 |
Symbol | |
ID | 7292534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 1194985 |
End bp | 1198140 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643589496 |
Product | type III restriction protein res subunit |
Protein accession | YP_002487171 |
Protein GI | 220911862 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG1061] DNA or RNA helicases of superfamily II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCAACT TCATTGGGGG AGAGGCGGCA GCCAAGCTGC CTGAAGGCCT GTACGAATTA CTTAGCACGG ACGCGCTCGG CGATTTGCTG AATAACGAGT CGGAACTGCA GCCTACCTTC GCGGACATAG AAGACGAAGA CGCTCCGGAC GTCCTATCCC GCCACGTTGC TGACGCCGTC CGCGAAGCCC TCACGGCGGC CAAGCCCGCA GACAGGGTCG CCCTGGCAAA TCGGCTCCTC CAGGACCTGA ACACGCCGGA CCGCATCGCA GAAGGTCCCA CTCGACTCCA GTCCCTGCAT CGCCCGGACG CGCTCAAGCG CCGACAACTT CGTCGCCCCA CCACCAAGCT CAGTGACTCA GCGCTGCTGA CTAACAGCAA GGACGAGCCC AATCTCGCCG CCGAGCTCCG TGCCGAGATC GAGTCCGCCA ATACGGTGGA CCTTCTCTGC GCCTTTGTCC GCTGGACGGG CCTCAGGCTC CTGGAGCCTG CCTTAGAGCA GCTGAAAGAA CGCGGAGCAA GGTTACGCGT CATCACCACC ACATATATGG GCGCCACGGA ACGCCGCGCC ATTGACGAGC TCGTCAACCG ATACGGGGCA GAGGTAATGA TCAGCTACGA AACGCAGGCA ACTCGATTGC ATGCCAAGGC CTGGCTGTTC CGCCGTAAAA CGGGATTCGA CACCGCTTAC GTTGGCAGCT CGAATCTGAG CCAGGCTGCG CTCCTGGACG GGCTGGAATG GAATGTCCGC CTCAGTTCCG TCGCAACGCC CGCCCTCCTG CAGAAGTTCG AGGTCACCTT CGACAGCTAT TGGGAGCAGC GTGCCTTTCA AAGCTACGAT CCGGAACGCG ACGGGGAGAA GCTCGACGCT GCGCTGGAGC GCAACGGTGG CCGCCGCACC GTATCCCCGG ATGCCGCGAC AGGGCTTGAG GTTCAGCCAT TCCTTCATCA GGAGGAGATG CTGGAAGACC TCGAAGCGGA GCGCCTGAAA GGCCATAACC ACAACCTCCT GGTCGCAGCC ACCGGAACCG GAAAGACGGT CATTGCGGCT CTGGACTACA AACGGCTGTC CGAAGCTGCT GGCAGAGATC TAAAGCTGCT CTTCGTCGCC CACCGGCAAG AAATCTTGAA ACAAGCCATG CGCACCTATC GCGACGTCAT GCAGGACGGC GCCTTCGGTG AGCTATACGT AGGGGAGCAT AAGCCACAAG AGTGGAAGCA CATCTTCGCC TCCGTCCAGT CGCTGTCCTC ACTCGGCATC GAGCAGTTGG AGCCTGACTT CTTCGACGTC GTCGTCATCG ATGAGTTCCA CCACGCCATG GCGCCCACGT ACCGCCGCCT GCTGGACCAT CTGAAACCGC AGCAGCTCCT CGGACTCACG GCGACCCCGG AACGGGGTGA CGGCGTCGAC GTCGCCAAGC AATTCTTCGA TGGCCGGACA GCCAGTGAGC TTAGGCTCTG GGACGCTCTG GACGCTGACC TGCTGGTCCC GTTCCACTAC TTCGGCGTCT CCGATGACGT CGACCTGAGC CAGTTGGAGT GGAAACGCGG CAACTACGAC ACCACCCAGC TGAGCGCCCT CTACACAGGG AATGACGCCC GGGCCGCCAA GGTGATCCGT GAGCTCCGCG ACAAGGTCAC CAGCACCAAC CATATGCGGG CCATCGGCTT CTGCGTCTCG GTGCAGCACG CCCACTACAT GGCCGAGGTG TTCAACCGGG CAGGCATTGC CTCCGTCGCC GTCGATGGCA CCACTGACAA TGTTGACCGC GAGGAATCCC TCAGGCGTCT GGGGCAGCGG GAGATCAACT GCATCTTCGC CGTCGACCTT TTCAATGAAG GGCTGGACCT GCCGCAGGTG GACACTATCC TGCTGCTCCG GCCCACGCAG AGCGCCACGA TCTTCCTCCA GCAGCTGGGA CGCGGGCTGC GCCGTGCCGA AGGCAAAGCG GTGCTGACAG TCCTGGACTT CATCGGCCAG CAGCGCCGCG AGTTCCGCTT TGATCTGCGC TACCGGGCGC TGACGGGCTA CGGGCGCAAG GAGCTGGAGA AGGCCGTCGA GGACGAGTTC CCCTACCTGC TGTCCGGTTC GCAGATCATG CTGGACCGGG TGGCGCAGAA GGTGGTTCTG GACAACATCA AGGCGCAGCT GCGGTTCAAC CGTGCACAGC TGGTCCGGGA CATCGCCTCG TACGCCGAAA CCGAGCTGGA GGCCTATCTG GAGCGGTCGG GGAATGACGT GAAGACGATT TACCGCTCCA CCAAGGACTC GTGGACCGGC TACCTCCGGC AGGCAGGACT TATCGAGGGG CTCTCACCCC TGGAGTCCGT GCTGCGGGGG AAGCTCGAAG AGCTGTCGGA CGCGGCGGAA AAGAAGCTGC TGGGCCGCAT GGCCGCGCTG ATCCATGTGG ACGATCCGGA ACGCGTCGCT GCTTATTCGA TGCTGGTTGC TCCCGACGCG CCCCGCTACG CGGATCTTGG GATGCGTGAG CAGACTTTTG CACGCATGCT TTTCTTCACG CTGTGGGATG ACGGAGGCGG GTTCAGGACG TACGACGACG GACTGGACCA CCTGCGCGGC TACCAGTTTG TGTGCCGCGA GATCCGCCAG GTGGTGAAGA TGGGAGTGGC TGCATCGAAA CATGCAGCGA AGAACCTTGG CGCAGGCCTG CAGCACATTC CCCTGCTGTC CCACGCTACC TACCGGCGCG AGGAAGTTTT GGCGGCGCTG CAGTACGGTT CCCTGGAGCA GGGCAAGGAC GTGCAGCACC GAGAGGGTGT GGCTTGGTGT CCGGCGACGT CCACCGACGC CTTCTTTGTC ACCCTGAATA AGGATGACAA GAAGCACTCG GCAACCACGA TGTACAAGGA CTACGCCATC AGCCCCGAGC TGTTCCACTG GGAGTCGCAG AACGCTACCT CGCCTGGCAG CCCGACGGGG CGCCGATATC TGGACCGGAC ATCGCATGGG TCAAAGATTC TGATCTTCAC GCGCGATACA GCCGACGACG AGACCCGGCT GACAGTTCCG TACACCTGCC TGGGACAAGT GGACTACGTT CAGCACTCCG GCGAGAGGCC GATCGCTATC ACCTGGAAGC TGCACCGGCC GATGCCCGCG GATGTGTTCG CTACGGCTGC TGCAGTGGCG CAATAG
|
Protein sequence | MTNFIGGEAA AKLPEGLYEL LSTDALGDLL NNESELQPTF ADIEDEDAPD VLSRHVADAV REALTAAKPA DRVALANRLL QDLNTPDRIA EGPTRLQSLH RPDALKRRQL RRPTTKLSDS ALLTNSKDEP NLAAELRAEI ESANTVDLLC AFVRWTGLRL LEPALEQLKE RGARLRVITT TYMGATERRA IDELVNRYGA EVMISYETQA TRLHAKAWLF RRKTGFDTAY VGSSNLSQAA LLDGLEWNVR LSSVATPALL QKFEVTFDSY WEQRAFQSYD PERDGEKLDA ALERNGGRRT VSPDAATGLE VQPFLHQEEM LEDLEAERLK GHNHNLLVAA TGTGKTVIAA LDYKRLSEAA GRDLKLLFVA HRQEILKQAM RTYRDVMQDG AFGELYVGEH KPQEWKHIFA SVQSLSSLGI EQLEPDFFDV VVIDEFHHAM APTYRRLLDH LKPQQLLGLT ATPERGDGVD VAKQFFDGRT ASELRLWDAL DADLLVPFHY FGVSDDVDLS QLEWKRGNYD TTQLSALYTG NDARAAKVIR ELRDKVTSTN HMRAIGFCVS VQHAHYMAEV FNRAGIASVA VDGTTDNVDR EESLRRLGQR EINCIFAVDL FNEGLDLPQV DTILLLRPTQ SATIFLQQLG RGLRRAEGKA VLTVLDFIGQ QRREFRFDLR YRALTGYGRK ELEKAVEDEF PYLLSGSQIM LDRVAQKVVL DNIKAQLRFN RAQLVRDIAS YAETELEAYL ERSGNDVKTI YRSTKDSWTG YLRQAGLIEG LSPLESVLRG KLEELSDAAE KKLLGRMAAL IHVDDPERVA AYSMLVAPDA PRYADLGMRE QTFARMLFFT LWDDGGGFRT YDDGLDHLRG YQFVCREIRQ VVKMGVAASK HAAKNLGAGL QHIPLLSHAT YRREEVLAAL QYGSLEQGKD VQHREGVAWC PATSTDAFFV TLNKDDKKHS ATTMYKDYAI SPELFHWESQ NATSPGSPTG RRYLDRTSHG SKILIFTRDT ADDETRLTVP YTCLGQVDYV QHSGERPIAI TWKLHRPMPA DVFATAAAVA Q
|
| |