Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3535 |
Symbol | |
ID | 7295016 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | + |
Start bp | 3917265 |
End bp | 3920567 |
Gene Length | 3303 bp |
Protein Length | 1100 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643591941 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_002489580 |
Protein GI | 220914271 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGACG ATGCCACGCC TGCCCTCCCG TCAGCTGCCG ATGCTTTGTT TCCCGGATTC GCGGACCCAC CCGTCACGGC CCGCCCCCGG GTGTGGTGGC ACTGGATGGA CGGAAACATC GACCCCGCTG GGATCGTCAA GGACCTTGAG TGGCTTTCCT CCTCCGGCGC AGGTGGAGTC CAGGCCTTCA CCGGTTCCAT GGGCATCCCC CAGTACACCC GGGAGCGCGT CTCTTTCCGT TCCCCGGCAT GGCAAACGGC CATCTCCTGC GCGGCGTCCA CTTCCACCAG GCTGGGACTG GAACTGGCCG TGGCCACCTC CGCCGGTTGG AGCGCGACGG GCGGGCCTTG GGTGGCACCC CACGCCGGAA TGAAGAAGCT CGTTTGGAGC ACCACGGCGG TGACCGCCGG CCAGCCGGCA CTGCCCCGGT TGGCCGTACC GCCGTCGGTA TCCGGCCCTT TCCAGGACGT CCCCTTCGGA GCGATCCGCA ATGATCCCGT TGGCGTCCCG GAGTTTTACG ACGACGTGGC GGTACTGGCA TTGCCCCGGC GCGGCGGACA CTTCCAGCTG ACGCCTTCAC GGGTGGTTGC CAGTGGTACC ACTTCCGGGG ACCGGCATCC TGAATCCTTG GCCGACGGCA CGTTCTGGCC GACGGCGGAG GTTGTGGCAG CCTCCGGCAC TACATGGATA ACGGCAGAAT TCGATCAGCC AACGCACGTT TCCTCGGTGC GTGTGGGGCT GCCGGCCTCG CGTGGTTTCG GCGTAGCGCC GGCCCCGACA GCGCATGTGG AAGCCAGCAC CGACGGCGTC ACGTTCAGCA AGGTGGCTGA TCTCCCGGCA TCCGGGTCAC CTGTCCGGTC CGCAAGTTTC CCCACGATCA CAGCACGCTG CTTTCGGCTG GTCCTGGAGA CCGGAAAAGG CCACGAATTC CCCCTGGTAC AGGGGATAAA ACCACTGCCT TTTGCAGCTT CCAAGGGCGG CGGGACGTTC AAGGTTTCAG CTTTCCAGCT GTTCTCCGGC GGGCGCGTGG CCAGGTCGGA GGAAAAGGCC GGCTACGCGC CCGTGCCCGA TTACTACGCG CTCGACGGCA GGACTTGCCA TACGTCCGAC GCCGTCCAGC CGCAGGACAT CATTGATGTC ACCGCGTTCC TTGGACCCGA TGGAATTCTC GACTGGACAC CGGACAGCGG AGATTGGACC ATCCTGCGGT ACGGGTATTC CCTGACCGGG CACCTCAACG CCCCGGCCCC CGTTGACGCT ACCGGCTTGG AGGTGGACAA GCTGGACGCC GCTTTGGTTA CCCGGTACTT CAGCGATTAC CTGGGTTTCT TCGAAGAAGC CCTCGGCAGC GGCCTGGACG GGGTGTCTGC GCTGCTCAGC GACAGCATCG AATCAGGGCC CCAGAACTGG ACCGGTGCCA TGCGTGCCGA GTTCATGAAA CGCCGCGGCT ACGATCTGCT GCCGTGGCTT CCTGCCATCA GCGGCATCGT TGTTGGCAGT GCGGAACAAA GCGATTCCTT CCTGTGGGAC CTCCGGAAGA CCATCTCCGA ACTCCTCGCG GAGAACCACT ATGGAACCAT CGCGGACATC GCCAGGGAAC GGGGCCTGAC GTATTATGCG GAGGCTTTGG AGGACCACCG GCCGCAGCTG GGCGACGACG TCGGGATGCG TTCCTACGCT GATGTTCCCA TGGGTGCCAT GTGGTGCTAC GAGCCGGATA AGGGACCCCA GCCCACCTAC GTTGCCGATC TTCGTGGGGC CGCGTCGGTT GCCCACGTGT ACGGAAAGGC AGCCACCGGG GCAGAATCCA TGAGCGCCTT TGGCAAACCG TTCGCTTTTG CACCCCGGAC GCTCAAACCA ATCGCGGATC TGGAGTTCGC TTTGGGGGTC AACCTCCTGA ACATCCACAC TTCACCCCAT CAGCCCGAGG CAGTACCCAA GCCCGGAGTC ACCCTGTCGC CTTACCTTGG ACAGTCCTTT ACCCGGAATG AGACCTGGGC GCACGCGGCA AAGCCGTGGC TCGATTACCT GGGCCGCTGC AGCTACCTGC TGCAGCAGGG TATTTACGCC GCCGACGTCG CCTACTTTTA CGGTGAGGAA GCGCCCGTCA CCGGCGTTTT CGGCGATACC GCCCCCGAAG TCCCGGCAGG CCACGGCTTT GATCTCATCA ACCTGGACGG GCTGCTCAAC CACGTCACCG TAACGCCCGC TGGCGGCCTT CTCACCACTG GCGGCACCAC CTACCGGCTC CTTTATCTCG GAGGGACCAG CCACCGGATG ACGCTCCGGG CGGTTCGGCG ACTCATCGAA CTGCTGCAGG ACGGAGCCCT CGTGGCCGGA TGGCGTCCCG AACGCTCCCC CAGCCAAAGC GACGACCCCG CTGAGTGGAG TGCCGCCGTC GACCTTCTTT GGGGCGGACA TCCGGGCTTG ATTGACCTCG CGGGGGTGGC TGCAGAGGAG GGCGTGTCCA CTGCGTTGGC CCGGGCGGGG GTGGAGCCCG ACTGGGTTAT GGAGGCCGCG GCGGCAGCAA ACCTGCCCGT CATCCACCGG CAGTTGCCGA ACGCAGAACT GTATTTCGTC AGCAACCAGC GCGAACGTGC CGAACAGGTC AGCGCCTCTT TCAGGGGCAC GGCGACCGCC GCCGAATGCT GGGATCCGGT GGCTGCTTCC CGGACTCCCA TCGCCTTTCG CCCGGAGGCC GGAAGAACCG CAGTGGAGCT GCAGCTGGAA CCGTTCGGTT CGGCCTTCGT CCTCCTGCAC AGGACCGGTG GGGCAACTGT GAATGTCTCC GATACCAGCG AAGAGATTGC CGCAGCGCAC ACGCTTGAGG GCCCTTGGGA AGTGACCTTC GACGGCGATG GCCAGGATCC GTCCGCCCTT GTGATGCCGG GCGTGGCTCC TTGGGCCGGA CCCGGGGCCG ACGCCCACGG GCCCGACGTG ACCGGCTTCT CGGGGACAGG AACGTACCGC CACACTTTCT CAACCGATGG GATTCTGACC GGACCCGGCA AGCGGCTTCT GCTGGATCTG GGCGGGGTGA GTGAACTGGC TGAGGTCAGA GTCAACGGCA GCACTGTGGG CACTTTGTGG ACCTGCCCGT TCCGCGTCGA CGTCACAGAC GCGATCCGGG CCGGCAGCAA TGAGGTGGAA ATAGCTGTGA CCAATACCTG GTGGAACCGG CTTGCCGGCG ATGCCGCAGA GGGGAACTTC GCCCGCCCTG CGGCCTCCAT CTTTGAGCCG GATGCACCAA CCATGCCTGC TGGCCTTCAC GGCCCGGTGC GGTTGCTGGT CCTGGACGAC TGA
|
Protein sequence | MSDDATPALP SAADALFPGF ADPPVTARPR VWWHWMDGNI DPAGIVKDLE WLSSSGAGGV QAFTGSMGIP QYTRERVSFR SPAWQTAISC AASTSTRLGL ELAVATSAGW SATGGPWVAP HAGMKKLVWS TTAVTAGQPA LPRLAVPPSV SGPFQDVPFG AIRNDPVGVP EFYDDVAVLA LPRRGGHFQL TPSRVVASGT TSGDRHPESL ADGTFWPTAE VVAASGTTWI TAEFDQPTHV SSVRVGLPAS RGFGVAPAPT AHVEASTDGV TFSKVADLPA SGSPVRSASF PTITARCFRL VLETGKGHEF PLVQGIKPLP FAASKGGGTF KVSAFQLFSG GRVARSEEKA GYAPVPDYYA LDGRTCHTSD AVQPQDIIDV TAFLGPDGIL DWTPDSGDWT ILRYGYSLTG HLNAPAPVDA TGLEVDKLDA ALVTRYFSDY LGFFEEALGS GLDGVSALLS DSIESGPQNW TGAMRAEFMK RRGYDLLPWL PAISGIVVGS AEQSDSFLWD LRKTISELLA ENHYGTIADI ARERGLTYYA EALEDHRPQL GDDVGMRSYA DVPMGAMWCY EPDKGPQPTY VADLRGAASV AHVYGKAATG AESMSAFGKP FAFAPRTLKP IADLEFALGV NLLNIHTSPH QPEAVPKPGV TLSPYLGQSF TRNETWAHAA KPWLDYLGRC SYLLQQGIYA ADVAYFYGEE APVTGVFGDT APEVPAGHGF DLINLDGLLN HVTVTPAGGL LTTGGTTYRL LYLGGTSHRM TLRAVRRLIE LLQDGALVAG WRPERSPSQS DDPAEWSAAV DLLWGGHPGL IDLAGVAAEE GVSTALARAG VEPDWVMEAA AAANLPVIHR QLPNAELYFV SNQRERAEQV SASFRGTATA AECWDPVAAS RTPIAFRPEA GRTAVELQLE PFGSAFVLLH RTGGATVNVS DTSEEIAAAH TLEGPWEVTF DGDGQDPSAL VMPGVAPWAG PGADAHGPDV TGFSGTGTYR HTFSTDGILT GPGKRLLLDL GGVSELAEVR VNGSTVGTLW TCPFRVDVTD AIRAGSNEVE IAVTNTWWNR LAGDAAEGNF ARPAASIFEP DAPTMPAGLH GPVRLLVLDD
|
| |