Gene Arth_3445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3445 
Symbol 
ID4444175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3879296 
End bp3880795 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content66% 
IMG OID639691269 
Productbeta-galactosidase 
Protein accessionYP_832920 
Protein GI116671987 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCACT CCGCCAAAGA CCTGGCTGCC AGGATCCCGC CGTCCTTCAC GATGGGCGTG 
GCCACAGCCG CCTTCCAGAT TGAGGGAGCC CTCGACGAGG ACGGCCGCGG GCCCTCCGGG
TGGGACGTGT TTGCCCGGAA GCCCGGCGCC ATAGTGGACG ACCACAGCCC TGTCACAGCC
TGCGACCACT ACCACCGCAT GCCTGAAGAC GTTGCGCTGA TGAAGGAACT GGGGGTGGAT
TCCTACCGGT TCTCGCTGTC CTGGTCCCGC ATCCAGCCCG GCGGCAGCGG GCCGGTCAAT
CCGAAGGGCA TCGACTTCTA CGACCGGCTG CTGGACCAGC TGCTGGCCAG CGGCATCTCA
CCCATGGTGA CGCTCTACCA CTGGGACACC CCGCTGCCGC TCGACGAGGC CGGCGGCTGG
TTGAACCGGG ACACGGCGTA CCGGCTCGGC GAGTTTGCCT CTATTGCGGC TGCGGCCTTT
GGCGACCGCG TGGCCCGCTG GGTCACCGTC AATGAGCCTG CCACCGTGAC CACCAACGGC
TACGCCCTGG GACTCCACGC GCCGGGCGAA TCGCAGTTGC TCAAAGCGCT CCCCACAGTC
CACCACCAGC TCCTCGGCCA CGGCCTGGCC ATGCAGGCAT TGCGGGCTGC CAACGTACCC
GGCGAAATAG GCATCACCAA TGTCTATTCG CCCATGGTTC CGGCCTCGAT CAACCCGCTG
GACAAGCTGA GCACGGCGCT GATGGACGTT TTCCAGAACA GGCTCTTCGC GGATCCGGTG
CTCCTCGGAA AGTACCCTGA CGTCGTCCGG GCCGCCACGT TCTTCAGCTC GTTCAGCCCC
TCCGATGAGG ACATGGACCT GATCTCGCAG CCGTTGGACT TCTACGGGCT CAACTACTAC
ATGCCAACGC GGGTTGCGGC AGGTCCCGGC GACGGCGCCG TTCCCCCGGG GATGGCCGAG
GCCATGGGGG ATGACCTCAG CGGCAGCGCT CCCGGCGCAC CGTTCCACAT CACTGAGTTT
CCGGATGCCG AGACGACTGC GTACGGCTGG CCGATCAGGC CCGACTACAT GCCCGTGGCC
CTAGCCGAGA TGGCCGAACG CTACCCGGAG CTGCCCCCCG TCTTCATCAC GGAAGGCGGA
GCGAGCTTTG AGGATGTGGT GGTCCGGGAC AAGGCCGGGG ACAGGATCAT CATTCCGGAC
GAGCGCCGCC TGCGCTATCT TGCCGAGCAC ATCTCCAGCG CGGTGGAGGC CACCAGCCCG
GGGGGACCTG CTGAATCCAT TGACCTGCGT GGCTACTACG TATGGTCCCT TCTCGACAAC
TTCGAGTGGT CCGCGGGCTA TAAACAGCCC TTCGGACTCC TGCATGTTGA CTTCGAGACG
ATGGCCAGAA CGCCCAAGGC GTCCTACTAC TGGCTGCAGG AGCTGCAGGA AGCCCGCAAG
GAAGCCGCGG CTGAGGCTGT TGGCGAGAGT GCAGCCGGGG AACCGGAGCC CGTAGCCTGA
 
Protein sequence
MKHSAKDLAA RIPPSFTMGV ATAAFQIEGA LDEDGRGPSG WDVFARKPGA IVDDHSPVTA 
CDHYHRMPED VALMKELGVD SYRFSLSWSR IQPGGSGPVN PKGIDFYDRL LDQLLASGIS
PMVTLYHWDT PLPLDEAGGW LNRDTAYRLG EFASIAAAAF GDRVARWVTV NEPATVTTNG
YALGLHAPGE SQLLKALPTV HHQLLGHGLA MQALRAANVP GEIGITNVYS PMVPASINPL
DKLSTALMDV FQNRLFADPV LLGKYPDVVR AATFFSSFSP SDEDMDLISQ PLDFYGLNYY
MPTRVAAGPG DGAVPPGMAE AMGDDLSGSA PGAPFHITEF PDAETTAYGW PIRPDYMPVA
LAEMAERYPE LPPVFITEGG ASFEDVVVRD KAGDRIIIPD ERRLRYLAEH ISSAVEATSP
GGPAESIDLR GYYVWSLLDN FEWSAGYKQP FGLLHVDFET MARTPKASYY WLQELQEARK
EAAAEAVGES AAGEPEPVA