Gene Dole_1949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1949 
Symbol 
ID5694789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2358751 
End bp2360022 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content62% 
IMG OID641264547 
Product3-phosphoshikimate 1-carboxyvinyltransferase 
Protein accessionYP_001529830 
Protein GI158521960 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0128] 5-enolpyruvylshikimate-3-phosphate synthase 
TIGRFAM ID[TIGR01356] 3-phosphoshikimate 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAA TACAACAGCA ACCGGTCCAG TCCTGTGAGG TAAGCGTTCC CGGTTCAAAG 
AGCTATACCC ACCGTGTCCT GATTGCCGCG GCCCTGTCCG ACGGCGTCTG CCGGCTGGGA
AACTGCCTGG AGAGTGAAGA CACCCACCTG ACCCGGGAGG CCCTGGTAAA GATGGGAGTT
CGCATTGAAA AAGCGGAAGA TCGCCTGGTG GTGCATGGTA CCGGCGGCCG CCTGCTGCCC
TGCGGTGATC CCATCTTCCT GGGCAACTCC GGCACCTCCA TGCGGCTGCT CACCGGCGTG
GCCGCCATCG GCCAGGGGAC ATACCTGCTG ATCGGAACGG ATCGCATGGC CCAGCGGCCC
GTGGCCGACC TGCTGGAAGG CCTGGACCAG ATCGGCGTGC CGGCCCGTTC GGTGAACAAC
AACGGGTGCC CGCCCCTGGA GATTGTCGCC GGAAAAGCCC AGGGCGGGCA TGTTCGCCTG
CGGTGCGGCA TAAGCAGCCA GTATCTCTCT TCCTTGCTTC TGGCGGCCCC CTATATCGAC
GGCGGCCTGA ATATCGAGGT GACGGAGGGG CCGGTCTCAA AACCGTATAT CGACATGACC
CTGGACATCA TGGACCGGTT CGGCGTGACA GTGGAGCGGG ACGGGTATAC CCGTTTCCGC
GTGGCCGGAG GACAGTGCTA CCGGAAAGGC GATTACGCGG TGGAGCCCGA CGCCTCCCAG
GCCAGTTATT TCTGGGCCGC GGCGGCCGTG ACCGGTGCCA CGGTCAAGGT GATGGGCATG
ACTCCTGAAT CCCGGCAGGG AGACGTTCGG TTTGTAGAAG TGCTGGAGGC AATGGGATGT
AAGGTTAACA GGGAGATTGA CGGCATTGCC GTGACCGGAG GCCCGCTTTC GGCCGTGGAT
GTGGACATGG GCGACATGCC TGACCTGGTG CCCACTCTGT CGGTGGTAGC GGCATTCACG
CAAGGCATCA CCGTCATTCG CAACGTGGCT CACCTCAAGG AAAAAGAGAG CGACCGGCTG
GCGGCGGTGG CCGCCGAACT TTCAAAAATG GGGATTACCG TTGTCCGTAC CGACACCGGC
CTTGAGATCA CCGGAGGACG GCCCCATGGC GCGGTCATTG AAACCTACAA CGATCATCGC
ATGGCCATGA GCTTTGCCGT TGCTGGCCTG GTGACCCCTG GGGTGACCAT CGCCAATGAG
GGGTGCGTGG CCAAATCCTT TCCCGGCTTC TGGCAGGTGT TTGAAGGTCT TTACAGTTCA
GGTATTTCAT GA
 
Protein sequence
MKEIQQQPVQ SCEVSVPGSK SYTHRVLIAA ALSDGVCRLG NCLESEDTHL TREALVKMGV 
RIEKAEDRLV VHGTGGRLLP CGDPIFLGNS GTSMRLLTGV AAIGQGTYLL IGTDRMAQRP
VADLLEGLDQ IGVPARSVNN NGCPPLEIVA GKAQGGHVRL RCGISSQYLS SLLLAAPYID
GGLNIEVTEG PVSKPYIDMT LDIMDRFGVT VERDGYTRFR VAGGQCYRKG DYAVEPDASQ
ASYFWAAAAV TGATVKVMGM TPESRQGDVR FVEVLEAMGC KVNREIDGIA VTGGPLSAVD
VDMGDMPDLV PTLSVVAAFT QGITVIRNVA HLKEKESDRL AAVAAELSKM GITVVRTDTG
LEITGGRPHG AVIETYNDHR MAMSFAVAGL VTPGVTIANE GCVAKSFPGF WQVFEGLYSS
GIS