Gene Achl_1749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_1749 
Symbol 
ID7293209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp1974377 
End bp1976551 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content69% 
IMG OID643590158 
Productglycoside hydrolase clan GH-D 
Protein accessionYP_002487818 
Protein GI220912509 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.000000205426 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATCCCC TGCACCTCCG TTCCGCCGGC ACCAGCCTGC TGATCAGCTT CGACAGCGGG 
GAGGCCGAGG TCATCCACTG GGGCGCGGAC CTGGGTGCAG CACTTCCAGA CCTCGCCGTC
CTCACCACGC CCGTGGGGAA TTCCTCCGTG GACGCCCGCG TCCCGGCCGC CCTTCTCCCT
CAGGCCTCCT CGTCCTGGCG GGGCCGTCCC GCCCTCCGCG GCAACCGGAT CACCGACGGC
GCCCCGGGCC TGGACTTCTC CTCCCGCCTC CGCGTCACGT CCGTGGACAC CGCAGGCGCC
GGGCGGGCCA CCGTTGTCCA GGCCGACGCC GACACCGGCA TCAGCGTCGC CACCGTCCTG
ACACTGCACC CGGGCGGCCT GCTGGAACTG CGGCACACCC TTACCAACAA CGGCACCACC
CCCTTCCAGG TGGACGAACT GGCGACGGTC CTCCCCGTGG CGCCGGACGC CGTCGAACTC
CTCGACCTGA CCGGACGCTG GTGCCGCGAA CGCCACCCGC AGCGGCGGCC CATCCAGCAG
GGCACATGGG TCCGCTCCGG ACGGCACGGC CGTACTGGCC ACGATTCCTC TCTCCTGTTC
GCCGCCGGCT CCCAGGGTTT CGGCAATCGG CACGGCAAGG TCTGGGCCAC CCACCTGGCA
TGGAGCGGCA ACCACGAACA GTTCGCGGAC ACCATCGGCG ACGGGCGGAC CATGATCGGG
GGCTCGGAAC TGCTGGGCTC CGCGGAGGTC ATCCTCGCCC CGGGCGGCAG CTACACCACA
CCTGCCTTGT TCGCTGCATT TTCGGACCGC GGCCTGGACG GCATCACCGA GGCCTTCTAC
AGCTGGTTCC GGTCCCGCCC GCACCACATC CTTCCGGGCG CGAAGCCGCG CCCGGTGGTG
CTGAACACCT GGGAGGCCGT GTACTTCAAC CACGATCTCG ACACCCTGAT CGAGCTCGCC
GATTCCGCCG CGGACCTCGG CGTGGAGCGC TTCGTGCTCG ACGACGGCTG GTTCCGCGGC
CGCCGCGACG ACCACGCGGG CCTGGGCGAC TGGTACGTGG ACGAGACCCT GTGGCCTGAA
GGGCTGACGC CGCTGATCGA TGCCGTCACC TCCCGGGGGA TGGAGTTCGG GCTGTGGGTG
GAACCGGAGA TGGTCAACCT GGACTCGGAT GTTGCCCGGG CCCACCCGGA GTGGATTTCC
GGCCCTTCCG CGGTAGCGCA CAAGGACGGC GGCCGGCTGC CCCTCGAATG GCGCAACCAG
CACGTCATCG ACCTCGTCAA CCCGGAAGCC TGGCAGTACG TCTTCGACCG GATCTCGGCA
CTCCTGGGTG AGAACAACAT CAGCTACCTC AAGTGGGACC AGAACCGGGA CATCGTTGAA
CAGGGCCACG CCGGCCGCGC GTCCGTCCAT GAACAAACCC TCGCCGCCTA CCGGCTGTTC
GACGAGCTCC GCAAGGCCCA TCCCGGCGTC GAAATTGAAA GCTGCTCCTC CGGCGGCGGA
CGCGTTGACC TGGGCATCCT GGAGCGGACC GGACGCGTCT GGGCATCGGA CTGCAACGAC
GCCCTGGAAC GGCAGACCAT CCAGCGCTGG ACCGGCGCCG TGGTTCCGCC GGAGCTCGTG
GGCAGCCACA TCGGCCCCAC CACCTCGCAC ACCACCGCCC GCACCCACGA CCTTTCCTTC
CGTGCCATCA CAGCCTTCTT CGGCCACTTC GGCATGGAAT GGGATGTCCG CGGCGTCCAG
GGCGCAGAAC GGGAGGAGCT GCGGCGCGTT GTCGGGCTCT ACAAGGAACA CCGGGACCTG
ATCCACAGCG GCCGGCCGGT CCACGCGGAC ATCGCTGATG AGGCCTACCA GCTGCACGGC
GTGGCGTCTG GGGAACCTGC TGCGGAGGGT ACGACGGCGG CGCTGTTCGC CTTCGTCTGT
GCCCGCACCT CGGGTGCCGA GCAGCCGGGC CGGATGGGCC TGCCCGGACT CGACCCGGAC
CGGACCTACC GGGTGGACCC CATCTTCCCG GCCCCCGGCG ACAGCGACTA CGGACACACG
TTCACCCAGG TGCAGCCGCC GGCCTGGCTC AGTGACGGCG CAACAGCCAG CGGCCGGTTC
CTCGCCGAAG TTGGCCTGCC GATGCCGATG CTCAACCCGG AACACGCGGT GCTGATCAAG
GTCACAGCGC TCTAG
 
Protein sequence
MDPLHLRSAG TSLLISFDSG EAEVIHWGAD LGAALPDLAV LTTPVGNSSV DARVPAALLP 
QASSSWRGRP ALRGNRITDG APGLDFSSRL RVTSVDTAGA GRATVVQADA DTGISVATVL
TLHPGGLLEL RHTLTNNGTT PFQVDELATV LPVAPDAVEL LDLTGRWCRE RHPQRRPIQQ
GTWVRSGRHG RTGHDSSLLF AAGSQGFGNR HGKVWATHLA WSGNHEQFAD TIGDGRTMIG
GSELLGSAEV ILAPGGSYTT PALFAAFSDR GLDGITEAFY SWFRSRPHHI LPGAKPRPVV
LNTWEAVYFN HDLDTLIELA DSAADLGVER FVLDDGWFRG RRDDHAGLGD WYVDETLWPE
GLTPLIDAVT SRGMEFGLWV EPEMVNLDSD VARAHPEWIS GPSAVAHKDG GRLPLEWRNQ
HVIDLVNPEA WQYVFDRISA LLGENNISYL KWDQNRDIVE QGHAGRASVH EQTLAAYRLF
DELRKAHPGV EIESCSSGGG RVDLGILERT GRVWASDCND ALERQTIQRW TGAVVPPELV
GSHIGPTTSH TTARTHDLSF RAITAFFGHF GMEWDVRGVQ GAEREELRRV VGLYKEHRDL
IHSGRPVHAD IADEAYQLHG VASGEPAAEG TTAALFAFVC ARTSGAEQPG RMGLPGLDPD
RTYRVDPIFP APGDSDYGHT FTQVQPPAWL SDGATASGRF LAEVGLPMPM LNPEHAVLIK
VTAL