Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1828 |
Symbol | |
ID | 4445657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 2047213 |
End bp | 2049030 |
Gene Length | 1818 bp |
Protein Length | 605 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639689646 |
Product | glycoside hydrolase 15-related protein |
Protein accession | YP_831318 |
Protein GI | 116670385 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0799869 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGCAT TAATAGAGGA TTATGCGCTG CTCTCCGATC TTCACACCGG GCCCCTTGTT TCGCGGCGGG GCAGCGTTGA CTGGCTGTGT TTCCCGCGGT TCGACTCGCC CTCGGTCTTT GCTGCCCTAC TCGGGGGTGA GGAACATGGC CGATGGCTGT TGGCGCCGAG CGCGCCGGAG GCCGTGGTCA TTGACCGCCA CTATGTCGAC TCGACGTTTG TACTCCAAAC AACATGGCAG ACCGATGCCG GCGAGGTCTT AGTGACGGAC TTCATGCCGG TGGGTGACAG CCGTTCGCGT CTTGTCCGTC GCATGACAGG ACTCAGCGGG ACGGTCCTCA TGCGCCACGA GATTAGAATA CGACCCCAAT ACGCCACTGT GCTGCCTTGG GTGAGTCGGG TCCGTGATAG CGCGCCGGGT CAGGCTGCGG AGATTCTCCT GGCCATGTCG GGTCCGGACG CCCTGGCCCT CCGGGGAGAA GATCTTCCGA TTGCGGAAGG TCACCGACAT GCCGGGGAAT TTTTGGTTGC ACAGGGGAAG AACGTGGACT TCGAACTGAC ATGGTTCCCC TCGCACCAGG ATGTGCCATC GGCCGTCAAT GTCGATGCCG CCCTTGACCT GGCTGTGGCC TATTGGACCA CCTGGGCAGG AAATTGCCGC GATGACGGCA AATACGGCAG TGCAGTGAAG CGTTCGCTGC TGGTGCTTCG TGCGCTCACC CACTACGAGA CAGGCGGCAT TGTCGCTGCG CCCACCACAT CCCTACCGGA GGATTTCGGT GGGTCACGCA ATTGGGACTA CCGCTACTGT TGGCTGCGCG ATGCCTCTTT GACGTTGGAA GCCATGTTGA CACATGGCTA TGAATCCGAG GCGCTGAAGT GGCGCAACTG GCTTTTACGT GCACTCGCGG GCGATCCGGA GGACCTGCAG ATTATGTACG GCGTGGGGGG TGAGCGGGAC CTGACGGAAA AGGAACTCCC CCACCTTCCC GGATACCAGA ATTCCAGACC GGTGCGAATC GGTAACGCTG CTGTGTCCCA GTACCAGGCT GACGTGGTCG GTGAAGTGAT GGTGGCACTT GAAAGGCTTC GGCTGGCCGG AGGCAAGGAA GACCACTTCT CCTGGGCACT CCAGCGTGCG CTGCTCGGAT CCGTGGAAAA TCACCTCGAG GACAAAGACT TCGGCCTGTG GGAAATGCGC GGCGATGCCC AGTACTTCAC CCACTCCCGG GTGATGATGT GGGCCGCTTT TGACAGCGGA GTGCGGGCTG TTCGCGATCA CGGCCTACCT GGACCGGCAG AGCATTGGGA GCAACTCCGT GAGGGATTGG CCACGGAAAT CATGGATCTC GGATTCAACC GGGACCTCAA CTCCTTCACC CAAACCTATG GCGGTCGGCA GACGGACGCT GCTCTGCTGG CCTTGCCGCA GGTTGGCTTC CTTGCCTACG ACGATGAGCG CATGCTCGGT ACGGTTGACC AGCTGGAAAA GGAGCTGCTC ACTACCGAGG GCCTGCTGAT GCGCTACCGA ACCGAAACAG GAGTTGACGG ATTAGAACCG GGAGAACATG CGTTCCTCGC ATGCTCCTTC TGGCTAGTGG AACAGTACGC GCGGTCAGGC CGCTGGGCCG ACGCCAGGAA CCTGATGGAC GTGCTGGCCG GGCTCGCCAA CGAGCTGGGC CTGCTCAGCG AAGAATATTC AATGAAGGAA AAGCGGATGG CGGGAAACTT TCCACAAGCT TTCTCCCATC TGACCCTAGT GCGGGCGGCA GACGCCATGC ACGGCGTGGA CCGGCTCAGC CTGCATCCCA AACATTGA
|
Protein sequence | MAALIEDYAL LSDLHTGPLV SRRGSVDWLC FPRFDSPSVF AALLGGEEHG RWLLAPSAPE AVVIDRHYVD STFVLQTTWQ TDAGEVLVTD FMPVGDSRSR LVRRMTGLSG TVLMRHEIRI RPQYATVLPW VSRVRDSAPG QAAEILLAMS GPDALALRGE DLPIAEGHRH AGEFLVAQGK NVDFELTWFP SHQDVPSAVN VDAALDLAVA YWTTWAGNCR DDGKYGSAVK RSLLVLRALT HYETGGIVAA PTTSLPEDFG GSRNWDYRYC WLRDASLTLE AMLTHGYESE ALKWRNWLLR ALAGDPEDLQ IMYGVGGERD LTEKELPHLP GYQNSRPVRI GNAAVSQYQA DVVGEVMVAL ERLRLAGGKE DHFSWALQRA LLGSVENHLE DKDFGLWEMR GDAQYFTHSR VMMWAAFDSG VRAVRDHGLP GPAEHWEQLR EGLATEIMDL GFNRDLNSFT QTYGGRQTDA ALLALPQVGF LAYDDERMLG TVDQLEKELL TTEGLLMRYR TETGVDGLEP GEHAFLACSF WLVEQYARSG RWADARNLMD VLAGLANELG LLSEEYSMKE KRMAGNFPQA FSHLTLVRAA DAMHGVDRLS LHPKH
|
| |