Gene Arth_3767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3767 
Symbol 
ID4447852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4242957 
End bp4246034 
Gene Length3078 bp 
Protein Length1025 aa 
Translation table11 
GC content67% 
IMG OID639691591 
Productalpha-1,6-glucosidases, pullulanase-type 
Protein accessionYP_833242 
Protein GI116672309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1523] Type II secretory pathway, pullulanase PulA and related glycosidases 
TIGRFAM ID[TIGR02103] alpha-1,6-glucosidases, pullulanase-type 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCGCC ACTCCCGCCT GCCCGGGTCC CCGCCCGTCC TCAACAGGAA ACTCCTGGCT 
GCGCTGGCCG CAGCCTGCGT CACTCCCCTC ACCCTGGGCG GCCTGGCCCT CCCCGCCTTT
GCGGACCACA CTGCCACTCC GGGGACGGTG GCCCTCGTCG GATCGCTGCA GTCCGAAATC
GGCTGCCCGG GCGACTGGCA GCCGGAGTGC CCCGCCAGCA GGCTGCTGCC GGTCGAGGGA
TCCGCAACTC TGTTCCGGGC CACCTTCGAT GTCCCGGCCG GCACCTACGC CATGAAAGTC
GCCCTCAACA ATTCATGGGA CGAGAACTAC GGCGCAGGCG GGGCGGCCGG CGGGGGCGAC
ATCGTGCTCA ACGCGCCGGG AGGACCGGTG ACCTTCACGT ACGACCACGC CAGCCACAGC
CTCACGGACG ACGTGCCGGA CGCGCTGGGA GCCGACGCCG GGGCGCACTG GCTGACGGCG
GACACCATCG CTTGGAAGGC CCCCGCAGCC GAGGGCACGA CCTACCGGCT CTACAGCGCC
CCCGACGGCG GCCTGGCGGT TGCCGACGGA CAGGTCACCG GCGGTTCCGC CCTTCCCCTC
GAGCTCGATG CCCAAGGGTT GGATGCAGGC CTTGCGGCAA AGTATCCGCA CCTGGCGGGC
CTCGCCTCGC TGCGGCTGAC GGAAGCCGGC TCACGCCAGG CCAAGGAACT GCTCAAGGGC
CAGCTGCTGG TGGCCGCGAT CGGCCCGGAC GGCAAAGTCA CAGCCACTAC CGGGATCCAG
GTTCCCGGCG TCCTGGACTC GCTGTACCCC GGAGCGGCAG AGCGGGAACT TGGACTGGCC
TGGAAGGGCA GGCGCCCGGA GCTTTCGCTG TGGGCCCCCA CGGCCCGCAG CGTCGCTGTC
CGCACCTACG CCTCCGGCTC CGGCGGCGAA CCCGTCGCCA CCACGGCCAT GAAACCGGGC
AAGGACGGCG TATGGTCCAT TACCGGGGAC AAGGACTGGA ACGGTGGCTA CTACCTGTAC
GAAGTGGAGG TGTTCGTTCC GGAAACCGGC AAGGTCGAGC GGAACCTAGT CACCGACCCC
TACAGCGTTG GCCTCTCCGC CAACTCCGAG CGCAGCCTCT TCGTGGATCT GGATGACAAG
TCCCTGGCAC CGTCCGGCTG GGCCAAACTC CAGAAGCCCG CGCTCAGCAA GCCGGAGGAC
CTTTCCGTCT ACGAGCTGCA CGTGCGTGAC TTTTCCATCA CCGACGACTC CGTTCCGGCC
GGGCACCGCG GCACCTACAA GGCGTTTACG GACACCGGCA GCAACGGCAT GAAGCGCATG
CAGGAACTGG TGGACGCCGG GATGAACGCG GTCCACCTGC TGCCTGTCAA CGACATCGGC
ACCATCGAGG AGCATCGCAG CGAGCAGCGC GAGCCGCTCT GTGACCTCGC AGCCCTTCCC
GCCGACTCAG AGGAACAACA GGCCTGCGTG GCAGGGACGG CAGCCAAGGA CGGTTTCAAC
TGGGGCTACG ATCCCCTGCA CTACACAACA CCCGAAGGCT CCTACTCCAC CAACCCGGAC
GGAGCCACCA GGATCACGGA GTTCCGTGAA ATGGTGGCCG CCCTCAACAC CACCGGCGCG
CGGGTCATCC AGGACGTCGT CTACAACCAC ACCTCCAGCG CAGGCCAGTC CGGAAGCAAC
AACCTCGATC GGATTGTCCC GGGCTACTAC CACCGCCTGA ACGCCGTCAC CGGTTCCCTG
GAGACGTCCA CGTGCTGCGC CAACACGGCC ACCGAGAACG CCATGATGGG CAAGCTCATG
GTCGACTCGC TGGTGACGCT GGCCAGGACC TACAAACTGG ACGGGTTCCG TTTCGACCTC
ATGGGGCACC ACTCAAAGCA GAACATGCTG GACGTCAGGG CTGCCCTAGA CAAGCTGACC
CTGCACAGGG ACGGCGTGGA CGGCAAGAAC ATCGCCCTGT ACGGCGAGGG CTGGAACTTC
GGCGAGGTGG CCAACAATGC CAGGTTCGTG CAGGCCACCC AGGCGAACAT GGCCGGAACC
GGCATCGGGA CGTTCAACGA CCGCCTGCGC GACGCCGTCC GCGGCGGGGG CCCGTTCGAC
CCCGATCCGC GCGTCCAGGG TTTCGCTTCC GGATTGTTCA CGGATCCCAA CGATTCCCCG
GCCAACGGCA CACCCGAACA GCAGAAGGCC GCCCTCCTTC TCGCACAGGA CCTGGTGAAG
GTGGGCCTGA CCGGCAACCT GAAGGACTAC TCCTTCATTG ACCGCACCGG CGCCGCCGTC
AAGGGTTCGG ACGTACCGTA CAACGGCGCC CCGGCGGGGT ACACCAGCGA TCCCCAGGAG
GCCATCACCT ACGTGGAGGC CCACGACAAC GAGACGTTGT TCGACGCCCT GGCCCTGAAA
CTGCCACAGG ATACGCCCAT GGCCGACCGC ACCCGGATGC AGACCCTTGC GCTCAGCACC
ACGGCCTTCG GCCAGGGCGT CTCCTTCTGG CATGCCGGAG GCGAATCGCT GCGCAGCAAG
TCCCTGGACC GCAACAGCTA CGATTCCGGC GACTGGTTCA ACGTCCTGGA CCACACGGAC
GCCACCAATG GATTCGGCCG CGGCCTGCCG CCCAAGGCTG ACAACGAGGA CAAGTACGGC
TACATGCGCC CGTTGCTGGC AGATCCGGCC CAGAAGCCGG CTTCCGCGGA CATCACGCAA
GCCCGTCAAC GGGCCGAGGA ACTGCTGCGC ATCCGCAAGA GCACGCCGTT GTTCCACCTG
GGCGATGCCG GACTGGTCCA GCAGAAGGTC TCCTTCCCGG CGGGCGGCCC GGAACAAACC
CCGGGCGTTG TGGTGATGCG CATTGACGAT TCGGTCGGGA CGGACGTGGA CCCGAACCTG
CGCGGGCTGG TGGTGGTGTT CAATGCCTCG GATGAGGCCA CGAGCCAAAC CCTGTCCGGC
ACGGCGCGTT CCGCCTACGC CCTGCACCCG GTGCAGGCCG GGGGCGTCGA TTCCGTGGTC
AAGGCGGCCG GATATGATGC GCAGTCGGGG ACCTTCAGCG TCCCCGCCCG CACGGTGGCG
GTGTTCCAGG CCAGGTAA
 
Protein sequence
MIRHSRLPGS PPVLNRKLLA ALAAACVTPL TLGGLALPAF ADHTATPGTV ALVGSLQSEI 
GCPGDWQPEC PASRLLPVEG SATLFRATFD VPAGTYAMKV ALNNSWDENY GAGGAAGGGD
IVLNAPGGPV TFTYDHASHS LTDDVPDALG ADAGAHWLTA DTIAWKAPAA EGTTYRLYSA
PDGGLAVADG QVTGGSALPL ELDAQGLDAG LAAKYPHLAG LASLRLTEAG SRQAKELLKG
QLLVAAIGPD GKVTATTGIQ VPGVLDSLYP GAAERELGLA WKGRRPELSL WAPTARSVAV
RTYASGSGGE PVATTAMKPG KDGVWSITGD KDWNGGYYLY EVEVFVPETG KVERNLVTDP
YSVGLSANSE RSLFVDLDDK SLAPSGWAKL QKPALSKPED LSVYELHVRD FSITDDSVPA
GHRGTYKAFT DTGSNGMKRM QELVDAGMNA VHLLPVNDIG TIEEHRSEQR EPLCDLAALP
ADSEEQQACV AGTAAKDGFN WGYDPLHYTT PEGSYSTNPD GATRITEFRE MVAALNTTGA
RVIQDVVYNH TSSAGQSGSN NLDRIVPGYY HRLNAVTGSL ETSTCCANTA TENAMMGKLM
VDSLVTLART YKLDGFRFDL MGHHSKQNML DVRAALDKLT LHRDGVDGKN IALYGEGWNF
GEVANNARFV QATQANMAGT GIGTFNDRLR DAVRGGGPFD PDPRVQGFAS GLFTDPNDSP
ANGTPEQQKA ALLLAQDLVK VGLTGNLKDY SFIDRTGAAV KGSDVPYNGA PAGYTSDPQE
AITYVEAHDN ETLFDALALK LPQDTPMADR TRMQTLALST TAFGQGVSFW HAGGESLRSK
SLDRNSYDSG DWFNVLDHTD ATNGFGRGLP PKADNEDKYG YMRPLLADPA QKPASADITQ
ARQRAEELLR IRKSTPLFHL GDAGLVQQKV SFPAGGPEQT PGVVVMRIDD SVGTDVDPNL
RGLVVVFNAS DEATSQTLSG TARSAYALHP VQAGGVDSVV KAAGYDAQSG TFSVPARTVA
VFQAR