Gene Anae109_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnae109_0208 
Symbol 
ID5377075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. Fw109-5 
KingdomBacteria 
Replicon accessionNC_009675 
Strand
Start bp240734 
End bp241909 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content78% 
IMG OID640841720 
Product3-dehydroquinate synthase 
Protein accessionYP_001377410 
Protein GI153003085 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0205078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG TCGTCGAGGT GATCCCGGCC CGGCAGGGCG AGCACGAGTA CCCGGTGCGC 
GTGGGCGCCG GCGCGGCCGA GCTCGTGGGC GCGCTCGCCG ACGAGCACGA TCGCGTGATG
CTCGTCTCCT GCGCGCGCGT GCTCCGCACG CCGTTCGGGA GGGCGGTGCG CGCCCGGCTG
GAGCGCGAGG CGCCGCTCGC CCTCGTCCAC GTGCTGCCGG ACGGGGAGCG CGGCAAGACC
CTGCGCGAGC TGGAGCGCGC CGCGGTGGCG ATGCTGCGCG CGGGCGCGAC GCGCGGGAGC
CTCGTCGTCG CGCTGGGCGG CGGCGCGGTG ACCGACGCCG CCGGCTTCCT CGCCGCGACC
TACATGCGCG GGGTGCGCTG GGCGGCGGTG CCCACCACGC TCCTCGGGAT GGTGGACGCC
GCCATCGGCG GCAAGACCGC GGTGAACCTG CCGGGCGCGA AGAACGCCGT GGGCGCCTTC
CACCCCCCGG AGGCGGTCCT CGCCGATCCC GCCGCGCTCG AGACGCTGCC CGCGCGCGAG
CTCCGGAGCG GCAAGGGAGA GGTGCTGAAG TACGCCGCGC TCCAGCCCGC CCTCCTCGGC
GCGTTCGGGC AGCAGCTCGC GGGCGACGAG GTGGACCCGC TCGTCATCGC CACCTGCGCC
CGCATGAAGG TGGACGTCGT CGAGGCGGAT CCCACGGAGC AGGGGCCGCG CAAGCTCCTC
AACCTCGGCC ACACCTTCGG GCACGGGGTG GAGGCCGCCG GCGACTTCTC CCGCTACGCG
CACGGCGAGG CGGTGGCCAT CGGGCTGGCG TTCGCGTTCC GGCTCGCGGC GCGGCTCGGC
CGGGTGGGGG CGGACGCGGT CGCCGCGGTC GAGGAGCGGA TCGCCCAGGC CGGGCTCCCC
GCGCGCGTGC CCCGGGCCGA CGCGCGCGCG GCGGCGAAGC TCATGGCGTT CGACAAGAAG
CGCGGCGAGG GCGGCCTGCG CTGGGTCCTC CCGCTCGCGA CGGAGAACGG GGGCTGGACG
GTGGAGTGGG ACGTCGCGGC GGAGCCGGCG GCGATCGAGG CGACGGTCGC CGAGCTGGAG
GCGCCCGCCC CTTCGCGCGC TTCGACTCCG GCGCGCGGGC GCGCTAGGCT CGCCCGCGGC
CGCGGCGCGG CCGGAAAGGC GAAGGCGACC AGATGA
 
Protein sequence
MTDVVEVIPA RQGEHEYPVR VGAGAAELVG ALADEHDRVM LVSCARVLRT PFGRAVRARL 
EREAPLALVH VLPDGERGKT LRELERAAVA MLRAGATRGS LVVALGGGAV TDAAGFLAAT
YMRGVRWAAV PTTLLGMVDA AIGGKTAVNL PGAKNAVGAF HPPEAVLADP AALETLPARE
LRSGKGEVLK YAALQPALLG AFGQQLAGDE VDPLVIATCA RMKVDVVEAD PTEQGPRKLL
NLGHTFGHGV EAAGDFSRYA HGEAVAIGLA FAFRLAARLG RVGADAVAAV EERIAQAGLP
ARVPRADARA AAKLMAFDKK RGEGGLRWVL PLATENGGWT VEWDVAAEPA AIEATVAELE
APAPSRASTP ARGRARLARG RGAAGKAKAT R