Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3889 |
Symbol | |
ID | 4445090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4377171 |
End bp | 4379852 |
Gene Length | 2682 bp |
Protein Length | 893 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691714 |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_833364 |
Protein GI | 116672431 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1877] Trehalose-6-phosphatase [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | [TIGR00685] trehalose-phosphatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCTCG CCGGACTTTT CTCATCATTG GGGATTATCC GAAGGACGTC GCCTGTGCCT ACTCCGCTCA ACCAGTCCGT GGCCGATTCT GCCCTCCTAA CGCGCTCCCT GCCGCTCGGC CTGCTCCGCG CCTTTGTTCA GTCCGATGCA GCCGACGACG GCCTCACGCC AAGCCTGCTC GCTGAACTCA AGATCCTCGC ACGCACTCCC GGGCTGCTGG TGGCCTGCAA CTACGGCGGC ACGCTCTGCG ACGCGGAGGG GATCTCCACC GAAACCCTGC CGCTGGGGAG CGCCGCGATC GCTCTCCGGG CCCTTGCCGC ACTGCCCAAC ACCCACGCGG CCGTCATCTC CGGCAGGTCG CTGCGGGACC TGGCGGCAGT TTCCAGGCTT CCGGCGGAGG TGCACCTCGT GGGCTCCCAC GGCGCCGAAT TCGACATGGG CTTCGCCCAC GGCCTGTCCC TGGCCACCGA ATCGGTGCTG CAGCAGGCCA GCCAGGCCCT CGTTGAAACG ATCGGCGCCT ACAAGGGGAT CAGCATCGAG CGCAAGCCCG TTGCGGTGTC CGTACACACC CGGCCGGCCT CGACCGCGAT TGTTGCCAAG GTGCTGGAAA AGGCCGAAGA GGTGGCCCGC GCCCACGGCC TGTTCTACAT CGTGGACGGC TCCGTGCTGG ACCTTTCCGT GGTGGAGCCG TCCAAGGCCG ATGCGCTGGA GCACCTGCGC GCCCGCTTGG GCGTGAGTGC CGCCCTCTAT GCCGGCGACG CTTCCAGCGA CGAACTGGCA ATGGCCACCC TGCGCGGCCC GGATATGGGC ATCAAGGTGG GGGAGGGCCC CACTGCGGCC ACCCACCGCC TCCGGGACCC CGAGTCCTTC GCGCGGGTCC TGGCCATCCT CTTCGAACTG CGGCGGGCCT GGCTTTTCGG GGAGGACGCC GTTGGCCTGG AGCGCCACTC GATGATCGGC AACGGCTCGT CCACTGCGCT GATCACGCCC GAAGCCAAGG TCTGCTGGAT GAGCCACCCG CTGCCGGACT CGGGGTCCCT GTTCGCCCAT ATCCTGGGCG GCGACGCCGC GGGCCACTTC TCGGTGGAAC CGGTCAAGGC ATCCCAGGTC TTGGGCCAGC GCTATGTGGA CAGCACCATG ATCGTGGAAA CCCGCTGGGC TGACGTCACC GTGACCGACT ATCTTGAGCC GGCCCCGGAC GGGATCACGA GCCTGGTCCG CGTGCTGTCC GGCAGCGGCG CCGCCCGGAT CGTCTTCGCG CCCCGGCCCG ACTACGCCAA CGCCCCGTTC AGCATGGAGG CGCGGGGCGA GGAACTGCAC GTGGTGGGCA CGTCCGACCC GATCATCCTG CTGGCTCCCG GCGTCAGCTT TTCCATCACT TCGGACGGCC GCCATGCCAC TGCCACCGCG GACGTCAACC TGCGGAATGG CCCCGTGGTG CTCAACCTGC GCTGCGGCGA CACCGAGCCA ACTCACGCCG GGGTGGGCGG CGAAACCGAG CGCCGCGCCG CCGTCGCCCA TCACTCCCGG CGGTGGGTCC AGGACCTGGA CCTGCCGGGC GTCAAGCCGT CGCTGGTGCG CCGTTCGGCG CTGGTGCTGC GCGCACTGGT GCACGAACCC ACCGGCGCCG TCCTGGCCGC CCCCACCACC TCGCTGCCGG AAGGAATCGG CGGCACCCGG AACTGGGACT ACCGCTACTG CTGGTTGCGG GACGGGTCCA TGACCGTCAA TGCGCTCGTG GACCTGGGTT CCACGACGGA AGCGCATGGG TTCCTGCGCT GGCTGGGCCG GATCCTGGAC AACGCCCCCG GACCGGAATG GCTGCACCCG CTGTACTCGG TCACCGGGGC GCCGCTGTCC ACTGAGGCCA TCATCGAAAG TCTGCCCGGT TACGCGGGAT CACGCCCCGT CAGGATCGGC AACGCCGCGG ACCACCAGGT GCAGCTGGAT GTGTTCGGGC CCATCGCGGA ACTGATCTGC GCGGTGAGTC AGCGCGAGGG CACTTTGGAG GATTCGCACT GGGAGCTGAT GATCCAAATG GCCTCCGCCG TGATGGCCCG CTGGCACGAG GCCGATCACG GGATCTGGGA AGCGCGCCGC GCACCCCGGC ACCACGTCTA CACCAAGGTG ATGTGCTGGG TAACCCTGGA CCGGGCGCTG CGCACGGCGG CCAGGCACGG CAGGGCGCCG GAACCCGAGT GGGCACCCAC CGCCGCAACC ATCCGCGAGG AAGTCCTCCG CGAAGGCTGG GATGACGGTG CGGCGTCCTA CACCGTTGCG TACGACAGCC CCGACCTCGA CGCCGCGGTG CTGCACATCG GCCTGTCCGG CCTGCTGGAT GTGAACGACC AGCGGTTCCT GGACACCGTC ACCGCTGTGG AGCGGGAGCT TCGGGTGGGA CCCACCGTCT TTCGGTACAG GTACGACGAC GGGCTGCCGG GTTTGGAGGG CGGCTTCCAC ATCTGCACCA CGTGGCTCAT CGAGGCCTAT GTGGCGGTGG GCCGGATCGA GGAGGCGTGG GACCTGTTCG ACCAGCTGGT AAACCTGTTC GGCCCTACCG GACTGCTGCC CGAAGAATAC GATCCCGGGA CCGAAACCCA TCTGGGCAAC CACCCGCAGG CTTACTCCCA TCTGGGCTTC ATCCGCTGCG CCCGGATCCT GGACCAGCAC CAGAAGAACT AA
|
Protein sequence | MSLAGLFSSL GIIRRTSPVP TPLNQSVADS ALLTRSLPLG LLRAFVQSDA ADDGLTPSLL AELKILARTP GLLVACNYGG TLCDAEGIST ETLPLGSAAI ALRALAALPN THAAVISGRS LRDLAAVSRL PAEVHLVGSH GAEFDMGFAH GLSLATESVL QQASQALVET IGAYKGISIE RKPVAVSVHT RPASTAIVAK VLEKAEEVAR AHGLFYIVDG SVLDLSVVEP SKADALEHLR ARLGVSAALY AGDASSDELA MATLRGPDMG IKVGEGPTAA THRLRDPESF ARVLAILFEL RRAWLFGEDA VGLERHSMIG NGSSTALITP EAKVCWMSHP LPDSGSLFAH ILGGDAAGHF SVEPVKASQV LGQRYVDSTM IVETRWADVT VTDYLEPAPD GITSLVRVLS GSGAARIVFA PRPDYANAPF SMEARGEELH VVGTSDPIIL LAPGVSFSIT SDGRHATATA DVNLRNGPVV LNLRCGDTEP THAGVGGETE RRAAVAHHSR RWVQDLDLPG VKPSLVRRSA LVLRALVHEP TGAVLAAPTT SLPEGIGGTR NWDYRYCWLR DGSMTVNALV DLGSTTEAHG FLRWLGRILD NAPGPEWLHP LYSVTGAPLS TEAIIESLPG YAGSRPVRIG NAADHQVQLD VFGPIAELIC AVSQREGTLE DSHWELMIQM ASAVMARWHE ADHGIWEARR APRHHVYTKV MCWVTLDRAL RTAARHGRAP EPEWAPTAAT IREEVLREGW DDGAASYTVA YDSPDLDAAV LHIGLSGLLD VNDQRFLDTV TAVERELRVG PTVFRYRYDD GLPGLEGGFH ICTTWLIEAY VAVGRIEEAW DLFDQLVNLF GPTGLLPEEY DPGTETHLGN HPQAYSHLGF IRCARILDQH QKN
|
| |