Gene Achl_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0421 
Symbol 
ID7291848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp442297 
End bp443556 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID643588817 
Productglycoside hydrolase family 1 
Protein accessionYP_002486509 
Protein GI220911200 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC AGTTTCCGCA GGACTTCCTT TGGGGCGTGG CCACCGCGGG CCACCAGGTG 
GAAGGCAACA ACGTCAACAG CGATACCTGG TTCCTGGAGC AGCTGCCCGG GAGCATGTTT
TCTGAGCCGT CCGGAGACGC TGTGGACCAC TACCACCGCT ACCGCGAGGA CATTGCCCTG
ATTGCGGAGC TGGGCTTCAC CAGCTACCGC TTCTCCCTCG AGTGGGCACG GATCGAACCG
GCCGAAGGCC AGTTCTCCGT GGCCGCGCTG GACCATTACA AGCGTGTCCT GGAGGCCTGC
GTCGAGCACG GGCTCACCCC GGTGGTGACG TTCCACCACT TCGCCTCACC GCTGTGGCTG
CTGCAGTCCG GCGGGTGGGA GGGGGCCCGC ACGGCAGAGC TGTTTGCCCG GTACTGCGAC
CGCGCCATGA CTCACCTGGG CCACCTCATC GGCGTCGCCT GCACGCTGAA CGAACCCAAC
CTGCCCTGGC TCCTGGAGTC CTTCGGCATT GGCGGGGAAG CCCCGGAAAA CCGCGGATCC
GTACCCGTCT GGGCCGCCGC AGCGGAACGC CTGGGCGTTG ACCCGAGCAG CGTTGCCCCG
TTCCAGTTCT GCTCCACTGA GGCCGGGTTC GCCGTGAAGC TGGCCTCGCA CCAGGCCGCT
ACCGCCGTGA TCAAGGCGCA CCGCCCGGAC CTGCGGGTGG GCTGGACACT CGCCAATTCC
GACATCCAGT CCATCCCTGG CGGCGAAGCG ATTGCTGACA AGGTGCGCCG CGACGTCAAT
GAACGGTTCC TTGAAGCTTC CCGTGGCGAC GATTTTGTGG GCATCCAGAC GTATGGCCGC
ACGGTGTACG GCCCCGAAGG CCACGCCCCG GCGCCGGATG GCGTGGAAAC CAACCAGATG
GGTGAGGAAA TCTACCCGCA GGGCCTGGAA GCGACCATCC GGGAGGCAGC ACGGATAGCG
GGTATCCCGG TAATCGTCAC GGAAAACGGC CTGGCCACGG AAGATGACAC CCAGCGGCTG
GCCTACCTGC AGACCGCAGT CGAGGGCGTC GCGTCCTGCC TTGCCGACGG CATCGAAGTG
GGCGGGTACA TCGCCTGGAC AGCATTCGAC AACTACGAAT GGGTGTTCGG GTACCGGCCC
AAGTTCGGAC TGATCGCCGT AGACCGCACC ACCCAGGAGC GCACCCCCAA GGAGAGCGCG
CACTGGTTGG GCAGCTTCGC CCGGGAACAT GCGGCCTCCC AGGTGGCCCA ACCCGCCTGA
 
Protein sequence
MTNQFPQDFL WGVATAGHQV EGNNVNSDTW FLEQLPGSMF SEPSGDAVDH YHRYREDIAL 
IAELGFTSYR FSLEWARIEP AEGQFSVAAL DHYKRVLEAC VEHGLTPVVT FHHFASPLWL
LQSGGWEGAR TAELFARYCD RAMTHLGHLI GVACTLNEPN LPWLLESFGI GGEAPENRGS
VPVWAAAAER LGVDPSSVAP FQFCSTEAGF AVKLASHQAA TAVIKAHRPD LRVGWTLANS
DIQSIPGGEA IADKVRRDVN ERFLEASRGD DFVGIQTYGR TVYGPEGHAP APDGVETNQM
GEEIYPQGLE ATIREAARIA GIPVIVTENG LATEDDTQRL AYLQTAVEGV ASCLADGIEV
GGYIAWTAFD NYEWVFGYRP KFGLIAVDRT TQERTPKESA HWLGSFAREH AASQVAQPA