Gene A2cp1_4289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_4289 
Symbol 
ID7298364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp4780022 
End bp4781224 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content70% 
IMG OID643597095 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002494672 
Protein GI220919368 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.53848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGAACC GCTCGCTGCT CGCCCTGCTC GCCGCCCTGC TGCTCCCCGG CCTCGCCTCC 
GCGGCGGAGG TGGTGGTGTG GCACGCCTAC CGCGGCGCCG AGAAGGCCGC CTTCGAGAAG
GTCGCCGCCG CGTACAACGC GCGCCCCGGC AACCCGAACA AGGTGACCAC GCTCGCCGTC
CCTTACGACG CGTTCGCCGA CAAGATCTCC GCGGCGGTGC CGCGCGGCAA GGGGCCGGAC
GTCTTCATCT TCGCGCAGGA CCGCCTGGGC GGCTGGATCG AGGCGGGCAA CACCGTCGAG
TCGATCGACT TCTTCATGGA CGACGCGCTG AAGGCGCGCT TCATCCCCTC CACGCTCGAG
GCGATGACCT ACCGCGGCGG CGTGTGGGGG CTCCCGCTCA ACTACAAGTG CATCGCGCTC
GTCTACAACA AGAAGCTGGT GAAGGCGCCG CCGAGGACCA GCGCGGAGCT GGAGGCCGTC
GCGAAGAAGC TCACCCAGCG CGCCGCGGGC CGGTTCGGCC TCGCGTACTC CTACTCCGAC
TACTACTACC ACGCGGCCCT CCAGAACACG TTCGGCGGCC GGGTGTTCGA CGCCGGCAAG
CCCGTGCTCG ACGCGCCCGA GAACGTGAAG GCGGCGGAGC TGCTGCAGGC CTGGATCAAG
GCGGGCTTCA TGCCGGCGGA GCCGTCCACC GCGCTCATCA CCTCGCTGTT CAACGAGGGC
AAGGCGGCGA TGGTGTTCTC GGGCAACTGG TTCCTGGGCG AGATCGCCCC GGGCATCGAC
TGGGCCGTCG CCACGCTGCC GGCGCTCACC GAGGCGGGCG GGAAGCCCAT GCGCCCGTGG
ACCACGGTGG AGGGCGTCTA CGTGGCCGCG CCCTCGAAGC ACAAGGACGC CGCGTTCGAC
TTCGTGAAGT TCGCCACCGA CGTGGACGCC GCGCGGATCA TGGCGCTCGA GGGCCGGCAG
AGCCCCGCCA ACGCGAAGGT CTACGCCGAC GCGAAGGTGG CGGCCGACCC GGTGCTGGCC
GCGTTCAAGA AGCAGGTGGA CGTGGCGGTG CCCATGCCCA ACCTGCCCGA GATGACGATG
GTCTGGTCGC CCGCCACGAC CGCCATGGGC GCCATCACCC GCGGCGGCGA CGCGAAGGCC
GCGCTCGCCA AGGCGCAGGC CAAGGTCACC GAGGACGTCG CGAAGCTGCG CAAGAGCAAG
TGA
 
Protein sequence
MRNRSLLALL AALLLPGLAS AAEVVVWHAY RGAEKAAFEK VAAAYNARPG NPNKVTTLAV 
PYDAFADKIS AAVPRGKGPD VFIFAQDRLG GWIEAGNTVE SIDFFMDDAL KARFIPSTLE
AMTYRGGVWG LPLNYKCIAL VYNKKLVKAP PRTSAELEAV AKKLTQRAAG RFGLAYSYSD
YYYHAALQNT FGGRVFDAGK PVLDAPENVK AAELLQAWIK AGFMPAEPST ALITSLFNEG
KAAMVFSGNW FLGEIAPGID WAVATLPALT EAGGKPMRPW TTVEGVYVAA PSKHKDAAFD
FVKFATDVDA ARIMALEGRQ SPANAKVYAD AKVAADPVLA AFKKQVDVAV PMPNLPEMTM
VWSPATTAMG AITRGGDAKA ALAKAQAKVT EDVAKLRKSK