Gene Arth_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1828 
Symbol 
ID4445657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2047213 
End bp2049030 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content61% 
IMG OID639689646 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_831318 
Protein GI116670385 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0799869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGCAT TAATAGAGGA TTATGCGCTG CTCTCCGATC TTCACACCGG GCCCCTTGTT 
TCGCGGCGGG GCAGCGTTGA CTGGCTGTGT TTCCCGCGGT TCGACTCGCC CTCGGTCTTT
GCTGCCCTAC TCGGGGGTGA GGAACATGGC CGATGGCTGT TGGCGCCGAG CGCGCCGGAG
GCCGTGGTCA TTGACCGCCA CTATGTCGAC TCGACGTTTG TACTCCAAAC AACATGGCAG
ACCGATGCCG GCGAGGTCTT AGTGACGGAC TTCATGCCGG TGGGTGACAG CCGTTCGCGT
CTTGTCCGTC GCATGACAGG ACTCAGCGGG ACGGTCCTCA TGCGCCACGA GATTAGAATA
CGACCCCAAT ACGCCACTGT GCTGCCTTGG GTGAGTCGGG TCCGTGATAG CGCGCCGGGT
CAGGCTGCGG AGATTCTCCT GGCCATGTCG GGTCCGGACG CCCTGGCCCT CCGGGGAGAA
GATCTTCCGA TTGCGGAAGG TCACCGACAT GCCGGGGAAT TTTTGGTTGC ACAGGGGAAG
AACGTGGACT TCGAACTGAC ATGGTTCCCC TCGCACCAGG ATGTGCCATC GGCCGTCAAT
GTCGATGCCG CCCTTGACCT GGCTGTGGCC TATTGGACCA CCTGGGCAGG AAATTGCCGC
GATGACGGCA AATACGGCAG TGCAGTGAAG CGTTCGCTGC TGGTGCTTCG TGCGCTCACC
CACTACGAGA CAGGCGGCAT TGTCGCTGCG CCCACCACAT CCCTACCGGA GGATTTCGGT
GGGTCACGCA ATTGGGACTA CCGCTACTGT TGGCTGCGCG ATGCCTCTTT GACGTTGGAA
GCCATGTTGA CACATGGCTA TGAATCCGAG GCGCTGAAGT GGCGCAACTG GCTTTTACGT
GCACTCGCGG GCGATCCGGA GGACCTGCAG ATTATGTACG GCGTGGGGGG TGAGCGGGAC
CTGACGGAAA AGGAACTCCC CCACCTTCCC GGATACCAGA ATTCCAGACC GGTGCGAATC
GGTAACGCTG CTGTGTCCCA GTACCAGGCT GACGTGGTCG GTGAAGTGAT GGTGGCACTT
GAAAGGCTTC GGCTGGCCGG AGGCAAGGAA GACCACTTCT CCTGGGCACT CCAGCGTGCG
CTGCTCGGAT CCGTGGAAAA TCACCTCGAG GACAAAGACT TCGGCCTGTG GGAAATGCGC
GGCGATGCCC AGTACTTCAC CCACTCCCGG GTGATGATGT GGGCCGCTTT TGACAGCGGA
GTGCGGGCTG TTCGCGATCA CGGCCTACCT GGACCGGCAG AGCATTGGGA GCAACTCCGT
GAGGGATTGG CCACGGAAAT CATGGATCTC GGATTCAACC GGGACCTCAA CTCCTTCACC
CAAACCTATG GCGGTCGGCA GACGGACGCT GCTCTGCTGG CCTTGCCGCA GGTTGGCTTC
CTTGCCTACG ACGATGAGCG CATGCTCGGT ACGGTTGACC AGCTGGAAAA GGAGCTGCTC
ACTACCGAGG GCCTGCTGAT GCGCTACCGA ACCGAAACAG GAGTTGACGG ATTAGAACCG
GGAGAACATG CGTTCCTCGC ATGCTCCTTC TGGCTAGTGG AACAGTACGC GCGGTCAGGC
CGCTGGGCCG ACGCCAGGAA CCTGATGGAC GTGCTGGCCG GGCTCGCCAA CGAGCTGGGC
CTGCTCAGCG AAGAATATTC AATGAAGGAA AAGCGGATGG CGGGAAACTT TCCACAAGCT
TTCTCCCATC TGACCCTAGT GCGGGCGGCA GACGCCATGC ACGGCGTGGA CCGGCTCAGC
CTGCATCCCA AACATTGA
 
Protein sequence
MAALIEDYAL LSDLHTGPLV SRRGSVDWLC FPRFDSPSVF AALLGGEEHG RWLLAPSAPE 
AVVIDRHYVD STFVLQTTWQ TDAGEVLVTD FMPVGDSRSR LVRRMTGLSG TVLMRHEIRI
RPQYATVLPW VSRVRDSAPG QAAEILLAMS GPDALALRGE DLPIAEGHRH AGEFLVAQGK
NVDFELTWFP SHQDVPSAVN VDAALDLAVA YWTTWAGNCR DDGKYGSAVK RSLLVLRALT
HYETGGIVAA PTTSLPEDFG GSRNWDYRYC WLRDASLTLE AMLTHGYESE ALKWRNWLLR
ALAGDPEDLQ IMYGVGGERD LTEKELPHLP GYQNSRPVRI GNAAVSQYQA DVVGEVMVAL
ERLRLAGGKE DHFSWALQRA LLGSVENHLE DKDFGLWEMR GDAQYFTHSR VMMWAAFDSG
VRAVRDHGLP GPAEHWEQLR EGLATEIMDL GFNRDLNSFT QTYGGRQTDA ALLALPQVGF
LAYDDERMLG TVDQLEKELL TTEGLLMRYR TETGVDGLEP GEHAFLACSF WLVEQYARSG
RWADARNLMD VLAGLANELG LLSEEYSMKE KRMAGNFPQA FSHLTLVRAA DAMHGVDRLS
LHPKH