Gene Arth_3388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3388 
Symbol 
ID4444117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3810231 
End bp3812438 
Gene Length2208 bp 
Protein Length735 aa 
Translation table11 
GC content68% 
IMG OID639691211 
Productglycoside hydrolase, clan GH-D 
Protein accessionYP_832863 
Protein GI116671930 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCCCC TGCACCTCCG CTCCGCCGGC ACCAGCCTGG TGATCAGCTT CGACAGCGGG 
GAGGCCGAGG TCATTCACTG GGGCGCCGAT CTGGGCGCTT CACTCCCCGA TCTGGCCATC
CTCGGCGAAC CGATCCCGCC CTCCGCCATC GACGCCTCCG TCCCCGCCGG GCTGCTGCCG
CAGGCATCCT CCAGCTGGCG TGGACGCCCG GCCCTCCGGG GGCACCGGAT CGCCGACGGC
GTGCCCGGCT ACGACTTTTC CGTCCGCCTG CGCGTCACGG ACGTCAAGAC CGCAGGAAGT
TCAGCTGTGA TCGTCCAGTC TGATCCCGAT GCCGGAATCT CTGTTGAATC CACGCTGGAA
CTGCACGCCG GCGGGCTGCT GGAAATGCGC CACACCGTCA CCAACACCGG CACTTCGCCT
TTTCAGCTCG ACGAACTGGC CACGGTGCTG CCGGTGGCTC CCGACGCCGT CGAACTCCTT
GACCTGACCG GACGCTGGTG CCGTGAACGC CACCCTCAGC GCCGCGCCAT CCAGCAGGGC
ACCTGGGTGC GGACCGGCCG GCACGGCCGG ACCGGCCACG ACTCCTCCCT GCTGCTGGCC
GCCGGCACGG CAGGCTTCGG CAACCGCCAC GGCAAGGTCT GGGCCACCCA CCTTGCCTGG
AGCGGAAACC ATGAGCAGTT CGCGGACAGC ATCGGGGACG GACGGACCGT CATCGGTGGT
TCCGAGCTGC TGGGTCCGGC CGAAGTGGTC CTCCAGCCGA ACGGCAGCTA CACCACCCCC
GCTCTCTTCG CGGCCTACTC GGACCGCGGC CTGGACGGTA TCAGCGAAGC GTTCTACAGC
TGGTTCAGGA ACCGGCCGCA CCATGTGCTG CCTTCGGCGT CAGCCGTCTC AGGAGCAGCT
CATGCGGGCA CCGGCAAGGC CCGGCCTGTA GTGCTAAACG TCTGGGAAGC TGTCTACTTC
AACCACGATC TGGGTGTATT GGTCGAACTT GCCGATTCCG CGGCGGACCT GGGCGTGGAG
CGCTTTGTCC TCGACGACGG GTGGTTCCGC GGCCGCCGGC ACGACCAGGC AGGCCTGGGC
GACTGGTACG TGGACGAGGG CCTCTGGCCG GACGGGCTCA CACCCCTGAT CGACGCCGTC
ACATCGCGCG GCATGGAATT CGGCCTCTGG GTGGAGCCCG AAATGATCAA CCTGGACTCC
GACACCGCGC GCGCCCACCC GGACTGGATC GTCGGGCCGG CCGCACGGTC CCACAAGGAC
GGCGGCCGGC TGCCGTTGAC CTGGCGCCAC CAGCACGTCA TCGACCTGGT CAATCCCGAG
GCCTGGCAGT ACGTTTTCGA CCGCATTGAC GCCCTGTTGC GCGAAAACAA CATCAGCTAC
CTGAAGTGGG ACCAGAACCG GGACCTCACC GAGCACGGCC ACGCCGGGCG CGCCTCCGTC
CACGAACAGA CCCTGGCCGC CTACCGCCTC TTCGATGAGC TCAGGAAAGC CCATCCGGGC
CTCGAAATCG AGAGCTGCTC TTCCGGCGGG GCACGCGTGG ACCTGGGCAT CCTGGAACGC
ACGGACCGGA TCTGGGCTTC GGACTGCAAC GATGCCCTGG AACGCCAGAC CATCCAGCGC
TGGACCGGGC TGGTGGTGCC GCCGGAACTG GTCGGAGGAC ACATCGGCCC CACTACGTCA
CACACCACGG CCCGCACGCA CGACGTTTCG TTCCGCGCCA TCACGGCCCT GTTCGGACAC
TTCGGCCTCG AATGGGACGT CCGCCAGGTT CACGGCGCGG AGCGCGAAGA ACTCAAGCGG
TTCATCGGGC TCTACAAGGA GCACCGCGGC CTGATCCACT CGGGCCGGAT GGTCCGGGCG
GATGTTGCCG ACGATTCGCT GATGCTGCAC GGCGTCGTTT CCCACGGCAG CCCAGCAACC
GGGGACACGG CGGCACTGTT CGCGCTGGTC AGCACCAGGA CGTCGCCCGC GGAGCGTCCG
GGCCGCATCG CCATTCCGGG ACTGGACCAG GACCGCAGCT ACCGCGTGGA GGCCATCTTC
CCGACGCCCG GCGATGCCGA CTACGCGCAC AACTACACCC AGGCGCAGCC CCCCGCATGG
CTGACCGCGG GTGCAGAAGC CAGCGGCCGG TTCCTGTCCG AGGTGGGCCT GCCCATGCCC
GTCCTCAACC CGGAGCACGC ACTGCTGCTC AGCTTCACTG CCGTGTAG
 
Protein sequence
MDPLHLRSAG TSLVISFDSG EAEVIHWGAD LGASLPDLAI LGEPIPPSAI DASVPAGLLP 
QASSSWRGRP ALRGHRIADG VPGYDFSVRL RVTDVKTAGS SAVIVQSDPD AGISVESTLE
LHAGGLLEMR HTVTNTGTSP FQLDELATVL PVAPDAVELL DLTGRWCRER HPQRRAIQQG
TWVRTGRHGR TGHDSSLLLA AGTAGFGNRH GKVWATHLAW SGNHEQFADS IGDGRTVIGG
SELLGPAEVV LQPNGSYTTP ALFAAYSDRG LDGISEAFYS WFRNRPHHVL PSASAVSGAA
HAGTGKARPV VLNVWEAVYF NHDLGVLVEL ADSAADLGVE RFVLDDGWFR GRRHDQAGLG
DWYVDEGLWP DGLTPLIDAV TSRGMEFGLW VEPEMINLDS DTARAHPDWI VGPAARSHKD
GGRLPLTWRH QHVIDLVNPE AWQYVFDRID ALLRENNISY LKWDQNRDLT EHGHAGRASV
HEQTLAAYRL FDELRKAHPG LEIESCSSGG ARVDLGILER TDRIWASDCN DALERQTIQR
WTGLVVPPEL VGGHIGPTTS HTTARTHDVS FRAITALFGH FGLEWDVRQV HGAEREELKR
FIGLYKEHRG LIHSGRMVRA DVADDSLMLH GVVSHGSPAT GDTAALFALV STRTSPAERP
GRIAIPGLDQ DRSYRVEAIF PTPGDADYAH NYTQAQPPAW LTAGAEASGR FLSEVGLPMP
VLNPEHALLL SFTAV