Gene Achl_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_0402 
Symbol 
ID7291829 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp424545 
End bp426029 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content68% 
IMG OID643588798 
Productbeta-galactosidase 
Protein accessionYP_002486490 
Protein GI220911181 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCCA CCCAACGTCC GTTTCCGGAG GACTTCCTCT GGGGGACGGC AACAGCTGCC 
TACCAAATCG AGGGCGCAGC ACACGCCGAA GGCCGGGGGG ACTCCATCTG GGATGTGTTC
GCCCGGCTCC CCGGTGCCGT GGCGGATGGC CACAACGGGG ACATGGCCTG CGACCACTAC
CGGCGCTACC GGCAGGACGT GGCGCTCATG GGCCGGCTGA ACATGAAGGC CTACCGGTTC
TCCACCTCCT GGGCCCGCTG CATGCCTGAC GGCGTCACAC CCAATCCTGA CGGGATCGCC
TTCTACTCCC GGCTGGTGGA TGAGCTGCTG GCTGCCGGCA TCACCCCGTG GCTGACCCTC
TACCACTGGG ACCTGCCCCA GGCCCTGGAA GACAACGGCG GGTGGGCCAA CCGGGACACC
GCCTACCGCT TCGCCGACTA CGCGGCACTT ATGCACAGCG TGCTGGGGGA CAGGGTCCGC
ATCTGGACCA CCCTCAACGA ACCGTGGTGC TCGGCGTTCC TGGGTTACGC GGCAGGAATC
CACGCTCCGG GCCGGCAGGA ACCCCGGGCC GCCCTGGCTG CGGCGCACCA CCTGCTGCTC
GGCCACGGCC TGGCAGCGGC GGAGCTCCGC CGCCGGGATA CGGAGGCCTC CCTGGGCATC
ACCCTGAACC TGACAGTCTC CGACCCGAGG GATCCCGGCA GCGAAAGCGA CCGTGATGCA
GCGAGGCGGA TTGACGGGCA GTTCAACCGG ATCTTCCTTG ACCCGCTGTT CCGCGGCGAA
TACCCGGCTG ACGTCCTGGC GGACGTAGCG CACCTGGGAA TGGCGGACCT GGTGCAGGAC
GGCGACCTTG AGCTGATTGC CACGCCGCTG GATCTCCTGG GCGTCAACTA CTACCACGGC
GAATCGCTCA CCAAGGATCT GGCAGGGGCC CAGGAACAGG CGGCCCCGGA AACCACATCG
GTCCCCGGGC AGGCAACCCG CGCGGTGGCA TCGCCGTTCG TGGCGGCCGA CGGCGCCAGG
TCGGTCCGCC GGGGCCTTCC TGTCACGGGA ATGGGCTGGG AGGTGCAACC CGAAGGACTC
CGGCGCCTGC TCAACCGACT CCACACCGAG TACACGGGAC CGGCCGGGAT CCCGATCTAC
ATCACCGAGA ACGGCGCAGC CTATGACGAC GTGCCCGATG CCACAGGTTT CGTGGACGAC
CAGGACCGGC TGGGCTTCTT CGCCGCCCAT CTTGACGCCG TGCACCGGGC CATTGCGGAC
GGCGTGGATG TCCGCGGCTA CCTCGCGTGG TCACTGCTGG ACAACTTCGA GTGGTCCTTC
GGCTACCACC AGCGTTTTGG CATGGTCCGG GTGGACTATT GCACCCAGGA CCGGATTCCG
AAAGCGAGCG CGCTGTGGTA CTCATCGGTG GCATCCGGCA ACGCCCTTCC TGCGGGCAGC
AGCCCCGTCC CGCCGCCGGC CCGGGGTGTC GTATTGTCGG TGTAA
 
Protein sequence
MSSTQRPFPE DFLWGTATAA YQIEGAAHAE GRGDSIWDVF ARLPGAVADG HNGDMACDHY 
RRYRQDVALM GRLNMKAYRF STSWARCMPD GVTPNPDGIA FYSRLVDELL AAGITPWLTL
YHWDLPQALE DNGGWANRDT AYRFADYAAL MHSVLGDRVR IWTTLNEPWC SAFLGYAAGI
HAPGRQEPRA ALAAAHHLLL GHGLAAAELR RRDTEASLGI TLNLTVSDPR DPGSESDRDA
ARRIDGQFNR IFLDPLFRGE YPADVLADVA HLGMADLVQD GDLELIATPL DLLGVNYYHG
ESLTKDLAGA QEQAAPETTS VPGQATRAVA SPFVAADGAR SVRRGLPVTG MGWEVQPEGL
RRLLNRLHTE YTGPAGIPIY ITENGAAYDD VPDATGFVDD QDRLGFFAAH LDAVHRAIAD
GVDVRGYLAW SLLDNFEWSF GYHQRFGMVR VDYCTQDRIP KASALWYSSV ASGNALPAGS
SPVPPPARGV VLSV