Gene Achl_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3820 
Symbol 
ID7295308 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4263886 
End bp4265769 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content68% 
IMG OID643592230 
ProductPectinesterase 
Protein accessionYP_002489862 
Protein GI220914553 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases
[COG4677] Pectin methylesterase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCGAT TCACGCCCAG CCGGCGCACC GTGCTCGCTT CGGCTGCAAC CGGCGCCCTT 
ACCGCTGCGG CAGTCCTGGC CGGAGCAGCC CCGGCCGCGC ACGCAGCCGG CGGCCCCGCC
CGGGCGCAGT CCAACAGGAA ACCGGTGATC TTCGTAGTGG GGGATTCCAC CTCCTCCGCC
TACCAGCGCT CGGAACGTCC CCGCGCCGGC TGGGGCCAGG CCCTGCCGCT GCTGCTCGGG
CCGCAGGCGA CGGTGTTCGA CTACGCCTGG TCCGGCGCCT CCTCGAAAAG CTATGCCGAC
GCCGGCCTGC TGGGCGAGGT GGCGGAGGCG CTGCAACCGG GCGACTACCT GCTCATCTCC
TTCGGCCACA ACGACGAAAA AGTCGAGGAT CCGTCACGCG GCACCGATCC GCAAACCACG
TTCAAGGAGT ACCTGGGCCG GTACCTCGAC GCCGCATCAG CCGCAGGCGC CAAAGCCGTC
CTGGTCACCC CCGTGGAACG GCGCCGCTTC AGCGCCCTCG GCGTTGCCCA GGACACCCAC
GGCGCCTACC CCCAGGCGAT CAAGGAGCTC GCTGCGTCCC GCGGTGTTCC CCTGGTTGAC
CTGACGGCAT CCTCCAAGGA GCTCTGGCAA AAGCTGGGGC CGGAGGGAAC CACCTCGCAC
TTCCTGCATG CCACTCCCGG CCAGTACCCG CAGTACCCCA ACGGCGTCAC GGACAACACC
CACTTCCAGG CGCAGGGCGC CTTGGCCGTG GCACAGCTTG TTGTCGCAGG ACTGCAGGCG
CAGCCGGCCG CACCGCCGGG ATGGTTCCGG GAGCCCGGCG GGACGCTGGA CCCGCTCTCA
GCCATCTACT GGCCCAGCGA ACGCCCGGTG GACAAGCCCC TGGTCCTCAC CGTGGGAAAC
GGCGGGCAGT TCTCCACCGT CCAGGCCGCC GTCGACGCAG CACCGGCAAA CAGCACCCGC
CGTACGGAAA TCAGGATCGC GCCCGGGACG TACCGCGAGC TGATCCGGGT ACCGGCCAAC
AAACCGAGGG TGTCGTTCAT CGGACGCGGG CAGGCGCCGG AGGACGTGGT GCTGGTCTAC
AACAATGCGT CCGGCACCCC GAAACCCGAC GGCACGGGGA CCTATGGCAC CGGCGGGAGC
ACCAGCGTGC GGATCGACGG CCCGGATTTC ACGGCAGTGA ACCTGACGTT CAGCAACGAC
TTCGACGAAG CCGCTAATCC GGACATGAAG AACCGGCAGG CCGTGGCACT GTTCCTGGCC
GGGGACCGGG CTGTCCTCCG CAATATCCGC TGCCTGGGCA ACCAGGACAC ACTGCTGGTG
GATTCCCCCG CCCGCGGTGT GGAGGCACGC TCCTACTTCA CGGACTCCTA CGTTGAAGGC
GACGTGGATT TCATCTTCGG CCGGGGAACT GCCGTGTTCT CCGGCTGCCG GATCCGCTCC
CTGGACCGCG GGTCCAGCAC CAACAACGGA TACGTCTCCG CCGGCAGCGT CAATATCGGG
ATCAAGCACG GATACCTGTT CACCCAGTGC CGCTTCGTCT CGGATGCAGC GGCCGGATCC
GTCCATCTCG GCCGGCCATG GCATCCCAGT GGAGATCCGG AGGCCATTGC CCAGGTGCTG
GTCCGCGACT CGTGGCTGGG AGCGCACATC TCCGGAACGC CGTGGACGGA CATGAGCGGT
TTTTCGTGGC AGGAGGCCAG ATTCCACGAG TTCAACAACA ATGGCCCCGG ATCACGGGCA
ACCCCTGACC GGCCGCAACT GGATCCCCGC CTTGCAGAGG AGTTCACTGC TGGAGCGTAC
CTGCACGGGG CAGATGCCTG GGCGCCCCAG CTGGCCGGAA GCAAGGCCGA CAGTACCGCC
CTGAGCGTCG GCCAGCCCGC GTAA
 
Protein sequence
MRRFTPSRRT VLASAATGAL TAAAVLAGAA PAAHAAGGPA RAQSNRKPVI FVVGDSTSSA 
YQRSERPRAG WGQALPLLLG PQATVFDYAW SGASSKSYAD AGLLGEVAEA LQPGDYLLIS
FGHNDEKVED PSRGTDPQTT FKEYLGRYLD AASAAGAKAV LVTPVERRRF SALGVAQDTH
GAYPQAIKEL AASRGVPLVD LTASSKELWQ KLGPEGTTSH FLHATPGQYP QYPNGVTDNT
HFQAQGALAV AQLVVAGLQA QPAAPPGWFR EPGGTLDPLS AIYWPSERPV DKPLVLTVGN
GGQFSTVQAA VDAAPANSTR RTEIRIAPGT YRELIRVPAN KPRVSFIGRG QAPEDVVLVY
NNASGTPKPD GTGTYGTGGS TSVRIDGPDF TAVNLTFSND FDEAANPDMK NRQAVALFLA
GDRAVLRNIR CLGNQDTLLV DSPARGVEAR SYFTDSYVEG DVDFIFGRGT AVFSGCRIRS
LDRGSSTNNG YVSAGSVNIG IKHGYLFTQC RFVSDAAAGS VHLGRPWHPS GDPEAIAQVL
VRDSWLGAHI SGTPWTDMSG FSWQEARFHE FNNNGPGSRA TPDRPQLDPR LAEEFTAGAY
LHGADAWAPQ LAGSKADSTA LSVGQPA