Gene Dole_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1947 
Symbol 
ID5694787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2356890 
End bp2357954 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content63% 
IMG OID641264545 
Productchorismate synthase 
Protein accessionYP_001529828 
Protein GI158521958 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGGTA ATACTTTTGG AGAGCTGTTT CGCGTTACCA CATGGGGAGA GTCCCACGGG 
CCGGGCATCG GTGTGGTCAT CGACGGGTGC CCGCCCGGCC TTGCCCTGGA CGAAGCCGGG
GTGCAGAAAA TGCTGGACCG CCGCAAGCCC GGCGGCGGGT CCATTGCCAG CACCGCCAGA
AAAGAGGCGG ACCGGGCCGT TATCCTGTCC GGCGTGTTTG AAGGCAAAAC CACGGGCACC
CCGATCCTGA TCATGGCCCA TAACAGGGAT GCCCGGTCAT CCGCCTACAC CGACATCGCC
GGCCTGTTCC GGCCCGGGCA TGGTGACATC ACCTACACGG CCAAGTACGG CATTCGGGAC
TGGCGGGGCG GGGGCCGGGC CTCGGCCCGG GAGACCTTTG GCCGGGTGGC GGCCGGGGCC
GTGGCCGCTG AACTGCTTCG GCTTTCCGGT ATTTCAGTTG CGGCCTACAC CCTGGAACTG
GGCGGCATCC GCGCAACAAC CATTGATGTC GGGCAGGTTG ATCAGAACAT GTTCGGCTGC
CCGGACAGCA CTGTTATGGC GGCCATGACT GACCGTGTGA CCCAGGTAAA GCGGCGGGGT
GACTCTGTCG GCGGCATCGT CGAGGTCCGT GCCGATGGCG TGCCCGCCGG CCTGGGAGAG
CCGGTGTTTG ACAAACTGGA TGCCGACATT GCCAAAGCCC TGATGAGTAT CGGCGCGGTA
AAGGGAGTTG AGATCGGCGC CGGGTTTGAA GCATCGGGTA TGACCGGCTC CCGGAGCAAC
GATGAAATCA CGCCCCAGGG GTTTGCCACC AATAATGCCG GCGGCATTCT GGCCGGCATT
TCCAACGGGG ACCGGATCGT GGCCAGGGCC GCGGTCAAGC CGATTCCCTC CATCGGCATT
ACCCAGCAAA CCGTGGATAC AAACGGCAAA CCGGCCTCCA TTTCCATCAA GGGCCGGCAC
GATATTTCCG CCATTCCCCG GATCAACGTG GTGTGTGAGG CCATGGTGTG CCTGGTGCTG
GCCGATCATC TTCTTAGACA GAAAGCGATT TCATGGACCC GGTAA
 
Protein sequence
MAGNTFGELF RVTTWGESHG PGIGVVIDGC PPGLALDEAG VQKMLDRRKP GGGSIASTAR 
KEADRAVILS GVFEGKTTGT PILIMAHNRD ARSSAYTDIA GLFRPGHGDI TYTAKYGIRD
WRGGGRASAR ETFGRVAAGA VAAELLRLSG ISVAAYTLEL GGIRATTIDV GQVDQNMFGC
PDSTVMAAMT DRVTQVKRRG DSVGGIVEVR ADGVPAGLGE PVFDKLDADI AKALMSIGAV
KGVEIGAGFE ASGMTGSRSN DEITPQGFAT NNAGGILAGI SNGDRIVARA AVKPIPSIGI
TQQTVDTNGK PASISIKGRH DISAIPRINV VCEAMVCLVL ADHLLRQKAI SWTR