Gene Achl_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3719 
Symbol 
ID7295204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4145822 
End bp4148836 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content67% 
IMG OID643592128 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_002489763 
Protein GI220914454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones129 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGTCC AGTCCCCCAC AACGGTGTCG AATGCTGCTG CAAGTGATGC CTCCGACTCT 
GTGCCGCGCA ATCTTGAACT CGGCGCAGGA GGCGTTCTCG AGCTCTCCTC AGTGGCTCCG
GGGCGGGGTG CGCTTGCTCC CCGGGCCTAT CTCGCCAGCG ATGCGCCGCG GCTGTCCCTG
AACGGCGACT GGCAGTTCCG CCTCAGCCCG GGCATCCGCA GCGCCCCCGC CGAGGGGTGG
CAGCTGGGCC AGGATCTGGA CGGCTTCGAA ACCCTGCCCG TTCCGTCCAG CTGGCCCATG
CACGGGCACG GCGCACCGGC GTACACCAAC ATCCAGTTTC CGTTTGCCGT GGAACCCCCG
CATGTGCCGG AAGCCAACCC TATTGGTGAC CACCTCCTGA CGTTCCACGC CGGCCCGGAG
TTCTTCCCCA ACGCACTGCT GCGTTTCGAC GGCATCGACT CCGCCGGCAC GGTGTGGCTG
AACGGCGTCG AACTCGGCAC CACCCGCGGC AGCCGCCTTG CCCACGAATT CGACGTCTCA
GGAATCCTTA CGGAAGGCAG CAACACCCTC GCCGTGCGGG TGGCGCAGTT CTCGGCCGCC
AGCTACGTGG AGGACCAGGA CATGTGGTGG CTCCCGGGGA TCTTCCGGGA CGTCACCCTT
CAAGCCCGCC CTGCGGACGG CATAGACGAC GTCTTCGTCC ACGCTGACTA CGACGCCGGC
ACAGGCGAAG GTGTGCTGCG CGTGGAGGCG AGCCGGGGCG GAAAAGCGAT TGACGCCGTC
GTACGCGTTC CCGAACTGGA CCTGGAGCTT AAGGCCGGCC AGGAGCACCG CCTGCCATCC
GTAGCGCCGT GGTCGGCAGA GGAGCCCCGC CTGTACGAAG CCACGGTCAG CACCCCGGCC
GAAACGGTGT CCTTGCAGCT GGGATTCCGC AGCATCACCA TCGAAGACTC CCAGTTCAAG
GTGAACGGGC GGCGGATCCT GCTGCGCGGG GTGAACCGCC ACGAGCACCA TCCCCGGCTG
GGGCGCGTGG TTCCGCAGGA GGTGATGGAG GCGGAGCTCA AGCTCATGAA GCAGCACAAC
ATCAACGCAA TCCGCACATC GCACTACCCG CCCCACCCAA AATTCCTTGC CCTGGCTGAC
CAGCTTGGTT TCTACGTCGT GCTCGAGTGC GACCTGGAAA CGCACGGGTT CGAGAGCGCC
GGCTGGGCCC AGAACCCGAG CGATGATCCC CAATGGGAGG ACGCCCTGCT GGACCGGATG
CGCCGGACGG TGGAGCGGGA CAAGAACCAT GCAGCCGTCG TGATGTGGTC GCTGGGCAAC
GAATCCGGGA CCGGCCGGAA CCTGGCCGCG ATGTCGCGCT GGACCAAGGA CCGCGACCCG
TCCCGCCCCA TCCACTACGA GGGTGACTGG TCCTCACCCT ACGTTGACGT GTACTCCCGC
ATGTACGCCA ACCAGGCCGA GACTGCACTG ATTGGACAGG GAACTGAGCC TCCACTCGAT
GACCCCGAGC TGGATGCCCG GCGCCGTGCC ATGCCCTTCG TGCTGTGCGA ATATGTTCAC
GCCATGGGCA ATGGCCCGGG AGGAATGACC GAGTACCAGG AACTCTTTGA ACGCCACCCC
AGGCTGATGG GCGGCTTCGT CTGGGAGTGG CTGGAACACG GAATCGCCGT CCCCACGCCG
GGTGGCGGTG AGCACTTCGC CTATGGCGGT GATTTCGGTG AGGAAATTCA CGACGGCAAC
TTCGTCACCG ACGGACTGGT GGACGCCAAT CGCCAACCCC GGCCCGGCCT CCTCGACTTC
AAGCGCGTCG TGGCGCCGCT GCGGATCCAG GTTGCCGGGG ACTGGTCCGG CTTCACCGTC
CACAACGGAC ACGATTTCAC CGACACGTCC CCGTTCAGCT TCCAGTACAC GGTGGAGGCC
GACGGCGACA CTATCGACGG CGGCACGGTG GAGGTTCCGC GCGTCGCGCC GCAGTCCGCG
GCCACAGTTG CGTTGCCCGC CGGCCTTCCG CACACGGACG GGCCGGCAGT CCTCACCGTC
AGCGCCGTTC TCACCGCTGC GACGTCCTGG GCCGGCACCG GCCACGAGAT CGCCTGGGGC
CAGTCCGTCC GTGGCGTCGC GCGTGTTGAA GAGCCCCGCG CTGCTGAATC CGTGGCCGCC
ACTGACGATG AGCTGCGGCT GGGCCCGGCG GTGTTCAGCA GGCTGACGGG GATGCCCACA
CGCATAGGCG GCATCAGTGT GGCAAAGCTG GGGCTCTCGC TGTGGTGGCC CCCCACCGAC
AATGACCTGG GCAGGGAATG GGGCGGCGCC GACCTCCGGC CCCTGGCCAC CCAATGGAAG
GATGCAGGGC TTGACCGGCT CCATACCCGG CTGCTTGGCA TTAGCGCTGA ACGCACCGAC
GATGGCGGGG AGCGGCTCGT CACGAGGACC AGGGTGGGTG CCGCGGACAA ACAGTTCGGG
GTCCTTGTGG ACTACACGTG GACCAGTGAC GGGGAATCGC TGGCGCTCCG AACCCAGGTC
CGGCCGCAGG GATCCTGGGT CAACGCCGGA TTTGAGGTGG AGTGGGCCCG GATCGGCCTG
GAACTGGTGC TGGACGGCGG GACGTCAGCG GTGAGCTGGT TCGGGCAGGG ACCGCACCAG
GCTTATCCGG ATACGGGGCA AGGCACGCGG ACGGGATGGT TTTCCATGCC CTTGGGTGAC
CTCGATGTTG AGTATGCGCG GCCCCAGGAG TCCGGTGCCC GCGCCGGTGT CCGCTCGGCT
GCCCTTGAGG TGGATGGCGC CACGTTGGAC ATCGCCGGCG AGCCCTTTGC CCTGACGGTA
CGCCCCTACA GCCAGGCCGC ACTCAACGAA GCAACCCACC GGCCTGACCT GGCAGCCGAC
GGGCGGACCT ACATCTACCT CGACAACGCC ATGCGTGGTG TGGGCACCGG AGCCTGCGGT
CCGGGTGTCC TCGATCCGTA CCGCCTGCAG CCACGGGATG CCGACTTCAC GGTAGTGCTG
CGGGTCCGTC CGTAG
 
Protein sequence
MPVQSPTTVS NAAASDASDS VPRNLELGAG GVLELSSVAP GRGALAPRAY LASDAPRLSL 
NGDWQFRLSP GIRSAPAEGW QLGQDLDGFE TLPVPSSWPM HGHGAPAYTN IQFPFAVEPP
HVPEANPIGD HLLTFHAGPE FFPNALLRFD GIDSAGTVWL NGVELGTTRG SRLAHEFDVS
GILTEGSNTL AVRVAQFSAA SYVEDQDMWW LPGIFRDVTL QARPADGIDD VFVHADYDAG
TGEGVLRVEA SRGGKAIDAV VRVPELDLEL KAGQEHRLPS VAPWSAEEPR LYEATVSTPA
ETVSLQLGFR SITIEDSQFK VNGRRILLRG VNRHEHHPRL GRVVPQEVME AELKLMKQHN
INAIRTSHYP PHPKFLALAD QLGFYVVLEC DLETHGFESA GWAQNPSDDP QWEDALLDRM
RRTVERDKNH AAVVMWSLGN ESGTGRNLAA MSRWTKDRDP SRPIHYEGDW SSPYVDVYSR
MYANQAETAL IGQGTEPPLD DPELDARRRA MPFVLCEYVH AMGNGPGGMT EYQELFERHP
RLMGGFVWEW LEHGIAVPTP GGGEHFAYGG DFGEEIHDGN FVTDGLVDAN RQPRPGLLDF
KRVVAPLRIQ VAGDWSGFTV HNGHDFTDTS PFSFQYTVEA DGDTIDGGTV EVPRVAPQSA
ATVALPAGLP HTDGPAVLTV SAVLTAATSW AGTGHEIAWG QSVRGVARVE EPRAAESVAA
TDDELRLGPA VFSRLTGMPT RIGGISVAKL GLSLWWPPTD NDLGREWGGA DLRPLATQWK
DAGLDRLHTR LLGISAERTD DGGERLVTRT RVGAADKQFG VLVDYTWTSD GESLALRTQV
RPQGSWVNAG FEVEWARIGL ELVLDGGTSA VSWFGQGPHQ AYPDTGQGTR TGWFSMPLGD
LDVEYARPQE SGARAGVRSA ALEVDGATLD IAGEPFALTV RPYSQAALNE ATHRPDLAAD
GRTYIYLDNA MRGVGTGACG PGVLDPYRLQ PRDADFTVVL RVRP