Gene Arth_3889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3889 
Symbol 
ID4445090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4377171 
End bp4379852 
Gene Length2682 bp 
Protein Length893 aa 
Translation table11 
GC content68% 
IMG OID639691714 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_833364 
Protein GI116672431 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1877] Trehalose-6-phosphatase
[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID[TIGR00685] trehalose-phosphatase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCTCG CCGGACTTTT CTCATCATTG GGGATTATCC GAAGGACGTC GCCTGTGCCT 
ACTCCGCTCA ACCAGTCCGT GGCCGATTCT GCCCTCCTAA CGCGCTCCCT GCCGCTCGGC
CTGCTCCGCG CCTTTGTTCA GTCCGATGCA GCCGACGACG GCCTCACGCC AAGCCTGCTC
GCTGAACTCA AGATCCTCGC ACGCACTCCC GGGCTGCTGG TGGCCTGCAA CTACGGCGGC
ACGCTCTGCG ACGCGGAGGG GATCTCCACC GAAACCCTGC CGCTGGGGAG CGCCGCGATC
GCTCTCCGGG CCCTTGCCGC ACTGCCCAAC ACCCACGCGG CCGTCATCTC CGGCAGGTCG
CTGCGGGACC TGGCGGCAGT TTCCAGGCTT CCGGCGGAGG TGCACCTCGT GGGCTCCCAC
GGCGCCGAAT TCGACATGGG CTTCGCCCAC GGCCTGTCCC TGGCCACCGA ATCGGTGCTG
CAGCAGGCCA GCCAGGCCCT CGTTGAAACG ATCGGCGCCT ACAAGGGGAT CAGCATCGAG
CGCAAGCCCG TTGCGGTGTC CGTACACACC CGGCCGGCCT CGACCGCGAT TGTTGCCAAG
GTGCTGGAAA AGGCCGAAGA GGTGGCCCGC GCCCACGGCC TGTTCTACAT CGTGGACGGC
TCCGTGCTGG ACCTTTCCGT GGTGGAGCCG TCCAAGGCCG ATGCGCTGGA GCACCTGCGC
GCCCGCTTGG GCGTGAGTGC CGCCCTCTAT GCCGGCGACG CTTCCAGCGA CGAACTGGCA
ATGGCCACCC TGCGCGGCCC GGATATGGGC ATCAAGGTGG GGGAGGGCCC CACTGCGGCC
ACCCACCGCC TCCGGGACCC CGAGTCCTTC GCGCGGGTCC TGGCCATCCT CTTCGAACTG
CGGCGGGCCT GGCTTTTCGG GGAGGACGCC GTTGGCCTGG AGCGCCACTC GATGATCGGC
AACGGCTCGT CCACTGCGCT GATCACGCCC GAAGCCAAGG TCTGCTGGAT GAGCCACCCG
CTGCCGGACT CGGGGTCCCT GTTCGCCCAT ATCCTGGGCG GCGACGCCGC GGGCCACTTC
TCGGTGGAAC CGGTCAAGGC ATCCCAGGTC TTGGGCCAGC GCTATGTGGA CAGCACCATG
ATCGTGGAAA CCCGCTGGGC TGACGTCACC GTGACCGACT ATCTTGAGCC GGCCCCGGAC
GGGATCACGA GCCTGGTCCG CGTGCTGTCC GGCAGCGGCG CCGCCCGGAT CGTCTTCGCG
CCCCGGCCCG ACTACGCCAA CGCCCCGTTC AGCATGGAGG CGCGGGGCGA GGAACTGCAC
GTGGTGGGCA CGTCCGACCC GATCATCCTG CTGGCTCCCG GCGTCAGCTT TTCCATCACT
TCGGACGGCC GCCATGCCAC TGCCACCGCG GACGTCAACC TGCGGAATGG CCCCGTGGTG
CTCAACCTGC GCTGCGGCGA CACCGAGCCA ACTCACGCCG GGGTGGGCGG CGAAACCGAG
CGCCGCGCCG CCGTCGCCCA TCACTCCCGG CGGTGGGTCC AGGACCTGGA CCTGCCGGGC
GTCAAGCCGT CGCTGGTGCG CCGTTCGGCG CTGGTGCTGC GCGCACTGGT GCACGAACCC
ACCGGCGCCG TCCTGGCCGC CCCCACCACC TCGCTGCCGG AAGGAATCGG CGGCACCCGG
AACTGGGACT ACCGCTACTG CTGGTTGCGG GACGGGTCCA TGACCGTCAA TGCGCTCGTG
GACCTGGGTT CCACGACGGA AGCGCATGGG TTCCTGCGCT GGCTGGGCCG GATCCTGGAC
AACGCCCCCG GACCGGAATG GCTGCACCCG CTGTACTCGG TCACCGGGGC GCCGCTGTCC
ACTGAGGCCA TCATCGAAAG TCTGCCCGGT TACGCGGGAT CACGCCCCGT CAGGATCGGC
AACGCCGCGG ACCACCAGGT GCAGCTGGAT GTGTTCGGGC CCATCGCGGA ACTGATCTGC
GCGGTGAGTC AGCGCGAGGG CACTTTGGAG GATTCGCACT GGGAGCTGAT GATCCAAATG
GCCTCCGCCG TGATGGCCCG CTGGCACGAG GCCGATCACG GGATCTGGGA AGCGCGCCGC
GCACCCCGGC ACCACGTCTA CACCAAGGTG ATGTGCTGGG TAACCCTGGA CCGGGCGCTG
CGCACGGCGG CCAGGCACGG CAGGGCGCCG GAACCCGAGT GGGCACCCAC CGCCGCAACC
ATCCGCGAGG AAGTCCTCCG CGAAGGCTGG GATGACGGTG CGGCGTCCTA CACCGTTGCG
TACGACAGCC CCGACCTCGA CGCCGCGGTG CTGCACATCG GCCTGTCCGG CCTGCTGGAT
GTGAACGACC AGCGGTTCCT GGACACCGTC ACCGCTGTGG AGCGGGAGCT TCGGGTGGGA
CCCACCGTCT TTCGGTACAG GTACGACGAC GGGCTGCCGG GTTTGGAGGG CGGCTTCCAC
ATCTGCACCA CGTGGCTCAT CGAGGCCTAT GTGGCGGTGG GCCGGATCGA GGAGGCGTGG
GACCTGTTCG ACCAGCTGGT AAACCTGTTC GGCCCTACCG GACTGCTGCC CGAAGAATAC
GATCCCGGGA CCGAAACCCA TCTGGGCAAC CACCCGCAGG CTTACTCCCA TCTGGGCTTC
ATCCGCTGCG CCCGGATCCT GGACCAGCAC CAGAAGAACT AA
 
Protein sequence
MSLAGLFSSL GIIRRTSPVP TPLNQSVADS ALLTRSLPLG LLRAFVQSDA ADDGLTPSLL 
AELKILARTP GLLVACNYGG TLCDAEGIST ETLPLGSAAI ALRALAALPN THAAVISGRS
LRDLAAVSRL PAEVHLVGSH GAEFDMGFAH GLSLATESVL QQASQALVET IGAYKGISIE
RKPVAVSVHT RPASTAIVAK VLEKAEEVAR AHGLFYIVDG SVLDLSVVEP SKADALEHLR
ARLGVSAALY AGDASSDELA MATLRGPDMG IKVGEGPTAA THRLRDPESF ARVLAILFEL
RRAWLFGEDA VGLERHSMIG NGSSTALITP EAKVCWMSHP LPDSGSLFAH ILGGDAAGHF
SVEPVKASQV LGQRYVDSTM IVETRWADVT VTDYLEPAPD GITSLVRVLS GSGAARIVFA
PRPDYANAPF SMEARGEELH VVGTSDPIIL LAPGVSFSIT SDGRHATATA DVNLRNGPVV
LNLRCGDTEP THAGVGGETE RRAAVAHHSR RWVQDLDLPG VKPSLVRRSA LVLRALVHEP
TGAVLAAPTT SLPEGIGGTR NWDYRYCWLR DGSMTVNALV DLGSTTEAHG FLRWLGRILD
NAPGPEWLHP LYSVTGAPLS TEAIIESLPG YAGSRPVRIG NAADHQVQLD VFGPIAELIC
AVSQREGTLE DSHWELMIQM ASAVMARWHE ADHGIWEARR APRHHVYTKV MCWVTLDRAL
RTAARHGRAP EPEWAPTAAT IREEVLREGW DDGAASYTVA YDSPDLDAAV LHIGLSGLLD
VNDQRFLDTV TAVERELRVG PTVFRYRYDD GLPGLEGGFH ICTTWLIEAY VAVGRIEEAW
DLFDQLVNLF GPTGLLPEEY DPGTETHLGN HPQAYSHLGF IRCARILDQH QKN