Gene Arth_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3149 
Symbol 
ID4444262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3533515 
End bp3535506 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content68% 
IMG OID639690975 
ProductBeta-glucosidase 
Protein accessionYP_832627 
Protein GI116671694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGACTTCAC AGACCCGCAC CAGCCCGCGC CCCGGAAACC CGCGCCCCGG AAACCCCCGT 
CCCGGAAACT CCGGCAACAG CACCCAAACC GCCGGCACCG GCCCCGCCAC CTCCGTGGCG
GCCGACGGCA CACGGTACCG GGACCTCAAC GGCAACGGGA TCATGGATCC CTTCGAGAAC
CCTGGACTCA GCCCGCACGA ACGCGCAGCC GACCTCGTGG CCCGGCTCAG CCTCGAAGAA
AAGGCCGGGC TGATGTTCCA CACCGTGATT GAGACCGGAC CGGGCGGATC CCTGCTCGAG
ACTCCCGGAA ACATCAGCAA GTCGCCCACC AGCACGGTGA TCCTGGGCAA GTTCATGAAC
CACTTCAACA TTCACGCCTT GGGCACTGCC CGGGAGGCCG CGATGTGGAG CAACGCCCTG
CAACAGCTTG CGGCACAGAC CCCGCACGGC ATTCCGGTCA CGATCTCCAC GGACCCCCGC
CACGCCTTCA TCGAGAACTC CGGCGTATCG TTCACCGCCG CCCACTTTTC CCAGTGGCCC
GAGCCGATCG GCCTGGCCGC CGTCGGCAGT GCCGAGCTAA TCCGCCGCTT CGCCGAGATC
GCCCGCACGG AGTACACCGC CGTCGGCATC CGTGCCGCCC TGCACCCGAC CGTCGACCTC
GCCACAGAGC CACGCTGGTG CCGGCAGGCC GGAACCTTCG GGCAGGACTC CGAGCTCAGC
TCGAAGTACG TCGTCGAGTA CCTTCAAGGA TTCCAGGGCG ACGAGCTGGG TCCGGACAGC
GTTGCCTGCA CCACCAAGCA CTTCCCGGGC GGCGGTCCGC AGCGCGACGG CGAAGACGCA
CACTTCCCCT ACGGCCGGGA ACAGGTCTAC CCCGGCGGCC GCTTCGACGA GCACCTGGCT
CCGTTCCGGG CGGCCATCGC GGAAAAGACC AGCGCCATCA TGCCCTATTA CGGCATGCCG
ATCGGCGTCG AGCTGGACGG CGAGCCGGTG GAGGAAGTCG GCTTCGGCTA CAACAGGCAG
ATCATCACCA ACCTGCTCCG GGACAAGCTC GGCTACGACG GCGTGGTCCT CAGCGACTGG
GAGCTGGTCA ACGACAACAT CGTTGGCGAG CAGGTGCTGC CGGCGAGGGC CTGGGGCGTT
GAAGAGCTCA CCGCGCCCGA GCGGATGCTG AAGATCCTCA ACGCGGGCGT GGACCAGTTC
GGCGGCGAGG AATGCACCGA GCTGCTCCTT GGCCTGGTCC GGGACGGCCT GGTCAGCGAG
GAACGAATCG ATGAGTCCGC GCGCCGCCTG CTGCTGGTCA AGTTCCAGCT CGGCCTGTTC
GACAACCCCT TCGTGGACGA GGACGCAGCG GCCGAAATCG TTGGCAACCC CGAGTTCCGG
CGGGAGGGCC ACCGCGCGCA GGCAACGTCC GTCACGGTGC TGGCCAACGG AACGCACGAC
GGCGGGACGG CCCTGCCGCT GACCGGCTCA CCCGCCGTCT ACGTCGAAGG AATGGACCCG
CTGAGTTTCG ACGGTTTCGG GACTGTGGTG CAGGACCCGG ATCAGGCCGA TGTTGCCGTG
ATCCGGCTGC ACTCCCCCTG GGACCACCGC GACGACCTGT TCCTGGAACA GCACTTCCAC
GCCGGAAGCC TGGACTTCCC GCCCGGGCTG GTCTCCCGGC TCCGGACCCT GGCCGCGAAG
GTTCCGCTGA TCATCGACGT GCGGTTGGAC CGTCCAGCCA TCCTCACCCC GTTGGCGGGG
TTCGCTGCGG CCCTGGTGGG CACCTTCGGC GTCTCGGACA CCGCCCTGCT CGATGCCCTG
TTCGGACGCA TTGAGCCGCA GGGCAGCCTT CCCTTCGATA TTCCCAGGTC CATGGACGCC
GTACGTGGTT CACGCTCGGA TGTCCCCGGA GACACCGCCG ATCCCCTCTT CCGGTTCGGG
CACGGCCTGC GGCTACCCGC GGCGCCCAAC GCCGGAACCG CCGCTGCTAT CCAGTCAGGA
CTGAGCTCAT GA
 
Protein sequence
MTSQTRTSPR PGNPRPGNPR PGNSGNSTQT AGTGPATSVA ADGTRYRDLN GNGIMDPFEN 
PGLSPHERAA DLVARLSLEE KAGLMFHTVI ETGPGGSLLE TPGNISKSPT STVILGKFMN
HFNIHALGTA REAAMWSNAL QQLAAQTPHG IPVTISTDPR HAFIENSGVS FTAAHFSQWP
EPIGLAAVGS AELIRRFAEI ARTEYTAVGI RAALHPTVDL ATEPRWCRQA GTFGQDSELS
SKYVVEYLQG FQGDELGPDS VACTTKHFPG GGPQRDGEDA HFPYGREQVY PGGRFDEHLA
PFRAAIAEKT SAIMPYYGMP IGVELDGEPV EEVGFGYNRQ IITNLLRDKL GYDGVVLSDW
ELVNDNIVGE QVLPARAWGV EELTAPERML KILNAGVDQF GGEECTELLL GLVRDGLVSE
ERIDESARRL LLVKFQLGLF DNPFVDEDAA AEIVGNPEFR REGHRAQATS VTVLANGTHD
GGTALPLTGS PAVYVEGMDP LSFDGFGTVV QDPDQADVAV IRLHSPWDHR DDLFLEQHFH
AGSLDFPPGL VSRLRTLAAK VPLIIDVRLD RPAILTPLAG FAAALVGTFG VSDTALLDAL
FGRIEPQGSL PFDIPRSMDA VRGSRSDVPG DTADPLFRFG HGLRLPAAPN AGTAAAIQSG
LSS