Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3820 |
Symbol | |
ID | 7295308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 4263886 |
End bp | 4265769 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643592230 |
Product | Pectinesterase |
Protein accession | YP_002489862 |
Protein GI | 220914553 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG2755] Lysophospholipase L1 and related esterases [COG4677] Pectin methylesterase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCGAT TCACGCCCAG CCGGCGCACC GTGCTCGCTT CGGCTGCAAC CGGCGCCCTT ACCGCTGCGG CAGTCCTGGC CGGAGCAGCC CCGGCCGCGC ACGCAGCCGG CGGCCCCGCC CGGGCGCAGT CCAACAGGAA ACCGGTGATC TTCGTAGTGG GGGATTCCAC CTCCTCCGCC TACCAGCGCT CGGAACGTCC CCGCGCCGGC TGGGGCCAGG CCCTGCCGCT GCTGCTCGGG CCGCAGGCGA CGGTGTTCGA CTACGCCTGG TCCGGCGCCT CCTCGAAAAG CTATGCCGAC GCCGGCCTGC TGGGCGAGGT GGCGGAGGCG CTGCAACCGG GCGACTACCT GCTCATCTCC TTCGGCCACA ACGACGAAAA AGTCGAGGAT CCGTCACGCG GCACCGATCC GCAAACCACG TTCAAGGAGT ACCTGGGCCG GTACCTCGAC GCCGCATCAG CCGCAGGCGC CAAAGCCGTC CTGGTCACCC CCGTGGAACG GCGCCGCTTC AGCGCCCTCG GCGTTGCCCA GGACACCCAC GGCGCCTACC CCCAGGCGAT CAAGGAGCTC GCTGCGTCCC GCGGTGTTCC CCTGGTTGAC CTGACGGCAT CCTCCAAGGA GCTCTGGCAA AAGCTGGGGC CGGAGGGAAC CACCTCGCAC TTCCTGCATG CCACTCCCGG CCAGTACCCG CAGTACCCCA ACGGCGTCAC GGACAACACC CACTTCCAGG CGCAGGGCGC CTTGGCCGTG GCACAGCTTG TTGTCGCAGG ACTGCAGGCG CAGCCGGCCG CACCGCCGGG ATGGTTCCGG GAGCCCGGCG GGACGCTGGA CCCGCTCTCA GCCATCTACT GGCCCAGCGA ACGCCCGGTG GACAAGCCCC TGGTCCTCAC CGTGGGAAAC GGCGGGCAGT TCTCCACCGT CCAGGCCGCC GTCGACGCAG CACCGGCAAA CAGCACCCGC CGTACGGAAA TCAGGATCGC GCCCGGGACG TACCGCGAGC TGATCCGGGT ACCGGCCAAC AAACCGAGGG TGTCGTTCAT CGGACGCGGG CAGGCGCCGG AGGACGTGGT GCTGGTCTAC AACAATGCGT CCGGCACCCC GAAACCCGAC GGCACGGGGA CCTATGGCAC CGGCGGGAGC ACCAGCGTGC GGATCGACGG CCCGGATTTC ACGGCAGTGA ACCTGACGTT CAGCAACGAC TTCGACGAAG CCGCTAATCC GGACATGAAG AACCGGCAGG CCGTGGCACT GTTCCTGGCC GGGGACCGGG CTGTCCTCCG CAATATCCGC TGCCTGGGCA ACCAGGACAC ACTGCTGGTG GATTCCCCCG CCCGCGGTGT GGAGGCACGC TCCTACTTCA CGGACTCCTA CGTTGAAGGC GACGTGGATT TCATCTTCGG CCGGGGAACT GCCGTGTTCT CCGGCTGCCG GATCCGCTCC CTGGACCGCG GGTCCAGCAC CAACAACGGA TACGTCTCCG CCGGCAGCGT CAATATCGGG ATCAAGCACG GATACCTGTT CACCCAGTGC CGCTTCGTCT CGGATGCAGC GGCCGGATCC GTCCATCTCG GCCGGCCATG GCATCCCAGT GGAGATCCGG AGGCCATTGC CCAGGTGCTG GTCCGCGACT CGTGGCTGGG AGCGCACATC TCCGGAACGC CGTGGACGGA CATGAGCGGT TTTTCGTGGC AGGAGGCCAG ATTCCACGAG TTCAACAACA ATGGCCCCGG ATCACGGGCA ACCCCTGACC GGCCGCAACT GGATCCCCGC CTTGCAGAGG AGTTCACTGC TGGAGCGTAC CTGCACGGGG CAGATGCCTG GGCGCCCCAG CTGGCCGGAA GCAAGGCCGA CAGTACCGCC CTGAGCGTCG GCCAGCCCGC GTAA
|
Protein sequence | MRRFTPSRRT VLASAATGAL TAAAVLAGAA PAAHAAGGPA RAQSNRKPVI FVVGDSTSSA YQRSERPRAG WGQALPLLLG PQATVFDYAW SGASSKSYAD AGLLGEVAEA LQPGDYLLIS FGHNDEKVED PSRGTDPQTT FKEYLGRYLD AASAAGAKAV LVTPVERRRF SALGVAQDTH GAYPQAIKEL AASRGVPLVD LTASSKELWQ KLGPEGTTSH FLHATPGQYP QYPNGVTDNT HFQAQGALAV AQLVVAGLQA QPAAPPGWFR EPGGTLDPLS AIYWPSERPV DKPLVLTVGN GGQFSTVQAA VDAAPANSTR RTEIRIAPGT YRELIRVPAN KPRVSFIGRG QAPEDVVLVY NNASGTPKPD GTGTYGTGGS TSVRIDGPDF TAVNLTFSND FDEAANPDMK NRQAVALFLA GDRAVLRNIR CLGNQDTLLV DSPARGVEAR SYFTDSYVEG DVDFIFGRGT AVFSGCRIRS LDRGSSTNNG YVSAGSVNIG IKHGYLFTQC RFVSDAAAGS VHLGRPWHPS GDPEAIAQVL VRDSWLGAHI SGTPWTDMSG FSWQEARFHE FNNNGPGSRA TPDRPQLDPR LAEEFTAGAY LHGADAWAPQ LAGSKADSTA LSVGQPA
|
| |