Gene Arth_2898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2898 
Symbol 
ID4444420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3263830 
End bp3265230 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content71% 
IMG OID639690721 
Productglycoside hydrolase family protein 
Protein accessionYP_832377 
Protein GI116671444 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1486] Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCTCA TGATTGCCGG TGGCGGCGGA TTCCGTGTCC CGCTCGTGTA CCGGGCGCTC 
ACCTCCGGCC CCTTCGCGGG GCTGGTGCGC GAGCTTGTGC TCTACGACGT CGATCCCCTG
CGGCTGGCCG CCATCAAGGC CGTCCTGGCG TCGATGGCCC CTTCCAACGG TGCACGCGGC
CCTGCAGTCA GCACCACCAC CTCGCTGGCC GAGGGGCTCG AAGGCACCGC CATGGTGTTC
GCGGCCATCC GGCCCGGAGG AACAGCTGGC CGCACCGCTG ACGAACGCGT GGCAATCGGC
CTGGGCCTGC TGGGCCAGGA GACCACGGGC GCCGGCGGAA TCTCCTACGC GCTGCGATCC
ATCCCCGGGA TGCTGGCCTT GGCCCGAGAA ATGAAGCAGC GATGCCCCGG GGCATGGTTG
GTTAACTTCA CCAACCCGGC CGGGATGGTC ACCGAGGCGC TGGTGCCCGT GCTGGGAAAC
CGGGTGATCG GCATCTGCGA CTCGGCCGGC GGCCTGGTCC ACCGGGCGGC CCGGGCGGCC
GGCGCCGCCT TGCCGGAGGG AAGGCTCGAC GGCGTGGGCT ACTACGGGCT CAACCACCTG
GGCTGGCTGT ACCGGCTGGA ATCCGGCGGA AAGGACCTGC TGCCCGGACT GCTGGCGGAC
GCGCAGGCCC TGTCCGGCTT CGAGGAGGGC CGGCTCTTTC CGCAGCCGTT CCTCCAGGCA
CTCGGCTGCC TGCCCAACGA GTACCTCTAT TACTACTACG ACACCGCCCG GGCCGTCGGC
GCCATCCGCG CCATGCGGCA GACACGCGGC GAGTCGATCC ACGAGCAGCA GTCGGGGCTC
TACCCGGCGC TGGCCGCGGC CGGATCCCGC GCCTACGAAC TGTGGGACGC GGCCCGCCGT
TCGCGCGAGG AAGGCTACCT CGCCGAGGCA CGCGCCGGCG GCGAACAACG GGACGAGGAG
GACCTCGCGG GCGGCGGGTA TGAGCGCGTT GCCCTCGCGG TTATGCGCGC CCTGGCCGGC
GGTGCGCCGG ACGATGCAGC GCCCCTCAGC GGTGGCATTG CTGCCGGGAT CACGGAGCTC
ATCCTGAACA CCCCCAACGA AGGTGCGGTT CCCGGGCTGC CGGCGGACGC TGTCGTCGAG
GTACCCTGCC AGGTGACGCC CGACGGCGCC ATGCCGCTGC CGCAGGACCG GCCCGGAGAC
GCGCAGCTTG CCCTCATGCA GCGGGTCAAG GAGGTGGAGC GGCTCACGGT GTCGTCCGTG
GTGCAGGGAC GACGGTCCGA CGCTCTGCGG GCTTTCGGGC TCCATCCGCT GATCGACTCC
GAGGAACTGG CTGTGAAACT CCTGGCCGGC TACGAGGCGG CGTTTCCGGC GCTCGGGCAG
CTGTGGCGCG GCGGGGCCTG A
 
Protein sequence
MRLMIAGGGG FRVPLVYRAL TSGPFAGLVR ELVLYDVDPL RLAAIKAVLA SMAPSNGARG 
PAVSTTTSLA EGLEGTAMVF AAIRPGGTAG RTADERVAIG LGLLGQETTG AGGISYALRS
IPGMLALARE MKQRCPGAWL VNFTNPAGMV TEALVPVLGN RVIGICDSAG GLVHRAARAA
GAALPEGRLD GVGYYGLNHL GWLYRLESGG KDLLPGLLAD AQALSGFEEG RLFPQPFLQA
LGCLPNEYLY YYYDTARAVG AIRAMRQTRG ESIHEQQSGL YPALAAAGSR AYELWDAARR
SREEGYLAEA RAGGEQRDEE DLAGGGYERV ALAVMRALAG GAPDDAAPLS GGIAAGITEL
ILNTPNEGAV PGLPADAVVE VPCQVTPDGA MPLPQDRPGD AQLALMQRVK EVERLTVSSV
VQGRRSDALR AFGLHPLIDS EELAVKLLAG YEAAFPALGQ LWRGGA