Gene A2cp1_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_4420 
Symbol 
ID7300476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp4917898 
End bp4918995 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content77% 
IMG OID643597226 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002494803 
Protein GI220919499 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.511372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCACCC CGACCTCCCT CGACGCCCAG GTCGCCGGCC TGTTCTGCGT CGGCTTCGAC 
GGCAAGACCG CCTCGCCCGA GGTGCTCGAG CTCATCCGCC GCGGCGTCCA CGGCGTGGTG
CTGTTCGCGC GCAACGTGGA GAGCGCCGAG CAGGTCGCGG CGCTCACCGC CGAGCTGAAG
CGCGCCGCAG GGCGGCCGCT GCTCGTGGCG GTGGACCAGG AGGGCGGGCG GGTGGCGCGG
CTGCGGGCGC GCCACGGCTT CACCGAGCTG CCGCCCATGC GCGCGGTGGG CGAGGTCGGC
GACGCCGGGC TGGCGCGCGA GGTGGGCGCG CTGCTCGGCC GCGAGCTGCG GGCGGTCGGC
ATCGACCAGG ACTACGCGCC GGTGGTGGAC GTGGACACCA ACCCCGCCAA CCCGGTCATC
GGCGACCGCA GCCTGTCCCG CGATCCGGAG ACGGTCGGCC GCCTCGGCGC CGCCATCGCG
CTCGGGCTCC AGTCGGCCGG GGTGGCCGCC TGCGCCAAGC ACTTCCCCGG GCACGGCGAC
ACGAGCCAGG ACTCGCACAC CGACCTGCCC CGCCTCCCGC ACGCGCTGGA GCGGCTCCGG
GCGGTGGAGC TCGCGCCGTT CCGCGCGCTG GCGCGCGCCG GGGTGGCCTC GGTCATGACC
GCCCACGTGG TGTTCGAGGC GCTCGACCGC GATCGCCCGG CCACGCTCTC GCCGGCGGTG
ATGCGCCTGC TGCGCGAGGA GGTCGGCTTC GACGGCTGCG CCATCTCCGA CGACCTGGAG
ATGCAGGCGG TGGCCGGGCA CTTCCCGCTC GAGGAGTCGG CGCCCGGGGC GGTGGCGGCC
GGGGTGGACG CGCTGCTCGT CTGCCACTCG CCCGCGGTAC AGCACCGGGC CATCGACCTG
GTGCGCGCCG CGGTGGAGAC GGGTCGCATC CCCGGGGACC GGGTCGCCGA GGCCCGGGGC
CGGGTGGGTC GGCTGCTCGC GTACGCGGGT CCGCCGCCGG ACCCGGCCCG GGTGCGGGAG
CGGCTGCGCA CGCCGGAGCA CCTGGCGCTG GTCGAGAACG TCCCGGCGCT CGAGGTCGGC
CGCGACCCGA CCGTCTGA
 
Protein sequence
MPTPTSLDAQ VAGLFCVGFD GKTASPEVLE LIRRGVHGVV LFARNVESAE QVAALTAELK 
RAAGRPLLVA VDQEGGRVAR LRARHGFTEL PPMRAVGEVG DAGLAREVGA LLGRELRAVG
IDQDYAPVVD VDTNPANPVI GDRSLSRDPE TVGRLGAAIA LGLQSAGVAA CAKHFPGHGD
TSQDSHTDLP RLPHALERLR AVELAPFRAL ARAGVASVMT AHVVFEALDR DRPATLSPAV
MRLLREEVGF DGCAISDDLE MQAVAGHFPL EESAPGAVAA GVDALLVCHS PAVQHRAIDL
VRAAVETGRI PGDRVAEARG RVGRLLAYAG PPPDPARVRE RLRTPEHLAL VENVPALEVG
RDPTV