Gene A2cp1_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA2cp1_1848 
Symbol 
ID7296374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter dehalogenans 2CP-1 
KingdomBacteria 
Replicon accessionNC_011891 
Strand
Start bp2067537 
End bp2068607 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content71% 
IMG OID643594643 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_002492256 
Protein GI220916952 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCAGAAGA CCGAGGACCT GAACATCGCC GCGTTCGACC TCATGCCGTC GCCGGACGAG 
GTCAAGGCGC GCATCCCCAT CACCGAGGAG GCCGTCCGCA CCGTGGTGGA GGGCCGCCGC
GCCATCGAGG CCATCCTCGA CGGCCGCGAC CCGCGCATGT TCGGCGTCAT CGGCCCCTGC
TCCATCCACG ACGCCGCCGC CGGGCTCGAC TACGCGCGCC GCCTGCGCCT CCTCGCCGAG
GAGGTGAAGG ACACGCTGGT GCTGGTGATG CGCGTGTACT TCGAGAAGCC GCGCACGTCC
GTGGGCTGGA AGGGCTTCAT CAACGATCCG TACATGGACG ACTCGTTCCG GGTGGACGAG
GGCATGGAGC GGGCGCGCCG CTTCCTGCTC CAGGTGAACG AGCTCGGCCT GCCCGCCGGC
ACCGAGGCGC TCGACCCGCA CGCGCCGCAG TACTACGGCG ACCTCGTCTC CTGGACCGCC
ATCGGCGCCC GGACCTCCGA GTCGCAGACG CACCGCGAGA TGTCGTCGGG CCTCTCCACC
CCGGTCGGCT TCAAGAACGG CACCGACGGC GACGTGGACG CGGCGGTGAA CGCCATCCTG
TCGGCGGGGC GGCCGCACAG CTTCCTGGGC GTGAACGGCC AGGGCCGGTC GGCCATCATC
CGCACCCGCG GCAACCGCTA CGGCCACCTG GTGCTGCGCG GCGGCGGCGG CCGGCCCAAC
TTCGACACCG TCTCCATCTC GCTCGCCGAG CAGGCGCTCA CGCGCGCCAA GCTCCCGCTC
AACCTGGTGG TGGACTGCTC CCACGCGAAC TCGTGGAAGA AGCCCGACCT GCAGCCGCTC
GTGCTCCGCG ACGTGGTGCA CCAGGTGCGC GAGGGCAACC GCTCGGTGGT CGGCTTCATG
GTCGAGAGCT TCATCGAGGC GGGCAGCCAG CCCATCCCCG AAGACCTCTC GAAGCTCCGC
TACGGCTGCT CGGTCACCGA CGCCTGCGTG GGCTGGGACA CCACCGTGGA GATGGTGCGC
CAGGCCCGCG AGGTGCTGAA GGACGTGCTG CCGCGGCGCG ACCGCGGGTA G
 
Protein sequence
MQKTEDLNIA AFDLMPSPDE VKARIPITEE AVRTVVEGRR AIEAILDGRD PRMFGVIGPC 
SIHDAAAGLD YARRLRLLAE EVKDTLVLVM RVYFEKPRTS VGWKGFINDP YMDDSFRVDE
GMERARRFLL QVNELGLPAG TEALDPHAPQ YYGDLVSWTA IGARTSESQT HREMSSGLST
PVGFKNGTDG DVDAAVNAIL SAGRPHSFLG VNGQGRSAII RTRGNRYGHL VLRGGGGRPN
FDTVSISLAE QALTRAKLPL NLVVDCSHAN SWKKPDLQPL VLRDVVHQVR EGNRSVVGFM
VESFIEAGSQ PIPEDLSKLR YGCSVTDACV GWDTTVEMVR QAREVLKDVL PRRDRG