Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3767 |
Symbol | |
ID | 4447852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 4242957 |
End bp | 4246034 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691591 |
Product | alpha-1,6-glucosidases, pullulanase-type |
Protein accession | YP_833242 |
Protein GI | 116672309 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases |
TIGRFAM ID | [TIGR02103] alpha-1,6-glucosidases, pullulanase-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCGCC ACTCCCGCCT GCCCGGGTCC CCGCCCGTCC TCAACAGGAA ACTCCTGGCT GCGCTGGCCG CAGCCTGCGT CACTCCCCTC ACCCTGGGCG GCCTGGCCCT CCCCGCCTTT GCGGACCACA CTGCCACTCC GGGGACGGTG GCCCTCGTCG GATCGCTGCA GTCCGAAATC GGCTGCCCGG GCGACTGGCA GCCGGAGTGC CCCGCCAGCA GGCTGCTGCC GGTCGAGGGA TCCGCAACTC TGTTCCGGGC CACCTTCGAT GTCCCGGCCG GCACCTACGC CATGAAAGTC GCCCTCAACA ATTCATGGGA CGAGAACTAC GGCGCAGGCG GGGCGGCCGG CGGGGGCGAC ATCGTGCTCA ACGCGCCGGG AGGACCGGTG ACCTTCACGT ACGACCACGC CAGCCACAGC CTCACGGACG ACGTGCCGGA CGCGCTGGGA GCCGACGCCG GGGCGCACTG GCTGACGGCG GACACCATCG CTTGGAAGGC CCCCGCAGCC GAGGGCACGA CCTACCGGCT CTACAGCGCC CCCGACGGCG GCCTGGCGGT TGCCGACGGA CAGGTCACCG GCGGTTCCGC CCTTCCCCTC GAGCTCGATG CCCAAGGGTT GGATGCAGGC CTTGCGGCAA AGTATCCGCA CCTGGCGGGC CTCGCCTCGC TGCGGCTGAC GGAAGCCGGC TCACGCCAGG CCAAGGAACT GCTCAAGGGC CAGCTGCTGG TGGCCGCGAT CGGCCCGGAC GGCAAAGTCA CAGCCACTAC CGGGATCCAG GTTCCCGGCG TCCTGGACTC GCTGTACCCC GGAGCGGCAG AGCGGGAACT TGGACTGGCC TGGAAGGGCA GGCGCCCGGA GCTTTCGCTG TGGGCCCCCA CGGCCCGCAG CGTCGCTGTC CGCACCTACG CCTCCGGCTC CGGCGGCGAA CCCGTCGCCA CCACGGCCAT GAAACCGGGC AAGGACGGCG TATGGTCCAT TACCGGGGAC AAGGACTGGA ACGGTGGCTA CTACCTGTAC GAAGTGGAGG TGTTCGTTCC GGAAACCGGC AAGGTCGAGC GGAACCTAGT CACCGACCCC TACAGCGTTG GCCTCTCCGC CAACTCCGAG CGCAGCCTCT TCGTGGATCT GGATGACAAG TCCCTGGCAC CGTCCGGCTG GGCCAAACTC CAGAAGCCCG CGCTCAGCAA GCCGGAGGAC CTTTCCGTCT ACGAGCTGCA CGTGCGTGAC TTTTCCATCA CCGACGACTC CGTTCCGGCC GGGCACCGCG GCACCTACAA GGCGTTTACG GACACCGGCA GCAACGGCAT GAAGCGCATG CAGGAACTGG TGGACGCCGG GATGAACGCG GTCCACCTGC TGCCTGTCAA CGACATCGGC ACCATCGAGG AGCATCGCAG CGAGCAGCGC GAGCCGCTCT GTGACCTCGC AGCCCTTCCC GCCGACTCAG AGGAACAACA GGCCTGCGTG GCAGGGACGG CAGCCAAGGA CGGTTTCAAC TGGGGCTACG ATCCCCTGCA CTACACAACA CCCGAAGGCT CCTACTCCAC CAACCCGGAC GGAGCCACCA GGATCACGGA GTTCCGTGAA ATGGTGGCCG CCCTCAACAC CACCGGCGCG CGGGTCATCC AGGACGTCGT CTACAACCAC ACCTCCAGCG CAGGCCAGTC CGGAAGCAAC AACCTCGATC GGATTGTCCC GGGCTACTAC CACCGCCTGA ACGCCGTCAC CGGTTCCCTG GAGACGTCCA CGTGCTGCGC CAACACGGCC ACCGAGAACG CCATGATGGG CAAGCTCATG GTCGACTCGC TGGTGACGCT GGCCAGGACC TACAAACTGG ACGGGTTCCG TTTCGACCTC ATGGGGCACC ACTCAAAGCA GAACATGCTG GACGTCAGGG CTGCCCTAGA CAAGCTGACC CTGCACAGGG ACGGCGTGGA CGGCAAGAAC ATCGCCCTGT ACGGCGAGGG CTGGAACTTC GGCGAGGTGG CCAACAATGC CAGGTTCGTG CAGGCCACCC AGGCGAACAT GGCCGGAACC GGCATCGGGA CGTTCAACGA CCGCCTGCGC GACGCCGTCC GCGGCGGGGG CCCGTTCGAC CCCGATCCGC GCGTCCAGGG TTTCGCTTCC GGATTGTTCA CGGATCCCAA CGATTCCCCG GCCAACGGCA CACCCGAACA GCAGAAGGCC GCCCTCCTTC TCGCACAGGA CCTGGTGAAG GTGGGCCTGA CCGGCAACCT GAAGGACTAC TCCTTCATTG ACCGCACCGG CGCCGCCGTC AAGGGTTCGG ACGTACCGTA CAACGGCGCC CCGGCGGGGT ACACCAGCGA TCCCCAGGAG GCCATCACCT ACGTGGAGGC CCACGACAAC GAGACGTTGT TCGACGCCCT GGCCCTGAAA CTGCCACAGG ATACGCCCAT GGCCGACCGC ACCCGGATGC AGACCCTTGC GCTCAGCACC ACGGCCTTCG GCCAGGGCGT CTCCTTCTGG CATGCCGGAG GCGAATCGCT GCGCAGCAAG TCCCTGGACC GCAACAGCTA CGATTCCGGC GACTGGTTCA ACGTCCTGGA CCACACGGAC GCCACCAATG GATTCGGCCG CGGCCTGCCG CCCAAGGCTG ACAACGAGGA CAAGTACGGC TACATGCGCC CGTTGCTGGC AGATCCGGCC CAGAAGCCGG CTTCCGCGGA CATCACGCAA GCCCGTCAAC GGGCCGAGGA ACTGCTGCGC ATCCGCAAGA GCACGCCGTT GTTCCACCTG GGCGATGCCG GACTGGTCCA GCAGAAGGTC TCCTTCCCGG CGGGCGGCCC GGAACAAACC CCGGGCGTTG TGGTGATGCG CATTGACGAT TCGGTCGGGA CGGACGTGGA CCCGAACCTG CGCGGGCTGG TGGTGGTGTT CAATGCCTCG GATGAGGCCA CGAGCCAAAC CCTGTCCGGC ACGGCGCGTT CCGCCTACGC CCTGCACCCG GTGCAGGCCG GGGGCGTCGA TTCCGTGGTC AAGGCGGCCG GATATGATGC GCAGTCGGGG ACCTTCAGCG TCCCCGCCCG CACGGTGGCG GTGTTCCAGG CCAGGTAA
|
Protein sequence | MIRHSRLPGS PPVLNRKLLA ALAAACVTPL TLGGLALPAF ADHTATPGTV ALVGSLQSEI GCPGDWQPEC PASRLLPVEG SATLFRATFD VPAGTYAMKV ALNNSWDENY GAGGAAGGGD IVLNAPGGPV TFTYDHASHS LTDDVPDALG ADAGAHWLTA DTIAWKAPAA EGTTYRLYSA PDGGLAVADG QVTGGSALPL ELDAQGLDAG LAAKYPHLAG LASLRLTEAG SRQAKELLKG QLLVAAIGPD GKVTATTGIQ VPGVLDSLYP GAAERELGLA WKGRRPELSL WAPTARSVAV RTYASGSGGE PVATTAMKPG KDGVWSITGD KDWNGGYYLY EVEVFVPETG KVERNLVTDP YSVGLSANSE RSLFVDLDDK SLAPSGWAKL QKPALSKPED LSVYELHVRD FSITDDSVPA GHRGTYKAFT DTGSNGMKRM QELVDAGMNA VHLLPVNDIG TIEEHRSEQR EPLCDLAALP ADSEEQQACV AGTAAKDGFN WGYDPLHYTT PEGSYSTNPD GATRITEFRE MVAALNTTGA RVIQDVVYNH TSSAGQSGSN NLDRIVPGYY HRLNAVTGSL ETSTCCANTA TENAMMGKLM VDSLVTLART YKLDGFRFDL MGHHSKQNML DVRAALDKLT LHRDGVDGKN IALYGEGWNF GEVANNARFV QATQANMAGT GIGTFNDRLR DAVRGGGPFD PDPRVQGFAS GLFTDPNDSP ANGTPEQQKA ALLLAQDLVK VGLTGNLKDY SFIDRTGAAV KGSDVPYNGA PAGYTSDPQE AITYVEAHDN ETLFDALALK LPQDTPMADR TRMQTLALST TAFGQGVSFW HAGGESLRSK SLDRNSYDSG DWFNVLDHTD ATNGFGRGLP PKADNEDKYG YMRPLLADPA QKPASADITQ ARQRAEELLR IRKSTPLFHL GDAGLVQQKV SFPAGGPEQT PGVVVMRIDD SVGTDVDPNL RGLVVVFNAS DEATSQTLSG TARSAYALHP VQAGGVDSVV KAAGYDAQSG TFSVPARTVA VFQAR
|
| |