Gene Arth_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1891 
Symbol 
ID4445580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2126563 
End bp2128014 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content61% 
IMG OID639689703 
Productglycoside hydrolase family protein 
Protein accessionYP_831375 
Protein GI116670442 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.001415 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTGC CTAAAGGATT CCGATGGGGC GGTGCCATCG CCGCAAATCA GGTCGAGGGC 
GCCTGGCGCG AGGGAGGCCG CGGTGCCGCC GTCTCCGACG TCGCAACCTA CAAGCCCGAC
GCCGACCCCA AGGACTACGC GATCCACCAC CAGATCACCG TGGAGAGCAT CAACGCAGCC
TTAGCGGATG ACGATGAACG GCTGTTTCCG AAGCGACGCG GCATCGACTT TTATCACCGG
TACCCGGGGG ACCTCGCATT ATTCGCCGAG ATGGGCTTTA CAACCTTGCG AGTCTCGATC
TCCTGGACCC GGCTCTATCC CACCGGAGAG GAGCTCGAAC CCCAGGCCGA CGGGGTCGCC
TTCTACAAGG CGTTGTTCAC TGAGATGCGG CGCCTGAACA TCGAACCCTT GGTGACCCTC
TCGCACTACG ACCCACCGAT AGCCCTCGCG CTCAAGCACA ACGGCTGGGT CGCGCGCCGC
ACAATCGCGT TATTCGAGCG CTTCGCCCGC ACCTGCTTCA GTGAGTTCGG CGACCTGGTG
AACATGTGGC TTACCTTCAA CGAGATCGAC GGCATCATCC GTCACCCATT CACCTCCGGT
GGCATCATCG ACGAAACCGT TGAGGGCAGC CTCGAGCAAG CCTGCTACAG CGCACTGCAC
CACCAGTTCG TGGCTGCAGC ATCGGTCACC AAAATGCTTC GCGAGATCTC ACCAGGGGCG
CAGATGGGTT GCATGCTCAC CATGCTCATG ACATACCCGA ATACCTGTCG TCCCGAAGAC
GTCGCCGCCA CGCAAGCGAA AGAGAGGCTG CTCTATCTAT GCACTGACGT GCAGGCCGGA
GGCGGCTACC CGCGGCTAGC CCTGCGGGCG CTCGAGCTTC GCGGCGTAAC CATCCCCTTC
CTTGACGGCG ACACCAAACT GCTCGCCGAA AATCCAGTCG ACTTCATCTC GTTCAGCTAC
TACAACTCGA TGACCGAATC GGTGCGCCCG GATGCCGAGC GCACACCAGG GAACACCGTG
CTCGGGGTGA AGAACCCGTT CCTCGATTCG AGCGAATGGG GATGGCAGAT CGACCCGGTC
GGCCTCCGGA TCGCGCTGAT CGACCTCTAC GACCGTTACG GCAAACCGTT GTTCATCGTG
GAGAACGGCC TGGGTATGCG CGACGAGCTG ACCGCCGAAG GCAAAATCCA CGACCCCTAC
CGTATCGGCT ACTTCCGCGC GCATTTCCAG CAGATGATCC AGGCCGTCGA TGAGGGCGTG
GAACTCATGG GCTACGTCAG CTGGGCGCCC ATCGACCTCA TCAGTTCGTC AAGCTCACAA
ATCTCGAAGA GATACGGCTT CATCTACGTC GATCAAGACG ACCTCGGCCA AGGAAGCGGA
GACCGTTACC GGAAGGACTC CTTCTTCTGG TACCAGAAGG TCATCGCGTC GAACGGCGCC
GACCTGGAAT GA
 
Protein sequence
MSLPKGFRWG GAIAANQVEG AWREGGRGAA VSDVATYKPD ADPKDYAIHH QITVESINAA 
LADDDERLFP KRRGIDFYHR YPGDLALFAE MGFTTLRVSI SWTRLYPTGE ELEPQADGVA
FYKALFTEMR RLNIEPLVTL SHYDPPIALA LKHNGWVARR TIALFERFAR TCFSEFGDLV
NMWLTFNEID GIIRHPFTSG GIIDETVEGS LEQACYSALH HQFVAAASVT KMLREISPGA
QMGCMLTMLM TYPNTCRPED VAATQAKERL LYLCTDVQAG GGYPRLALRA LELRGVTIPF
LDGDTKLLAE NPVDFISFSY YNSMTESVRP DAERTPGNTV LGVKNPFLDS SEWGWQIDPV
GLRIALIDLY DRYGKPLFIV ENGLGMRDEL TAEGKIHDPY RIGYFRAHFQ QMIQAVDEGV
ELMGYVSWAP IDLISSSSSQ ISKRYGFIYV DQDDLGQGSG DRYRKDSFFW YQKVIASNGA
DLE