Gene Dole_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1401 
Symbol 
ID5694236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1665411 
End bp1666589 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content58% 
IMG OID641263994 
Productcystathionine gamma-synthase 
Protein accessionYP_001529282 
Protein GI158521412 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGGG ACAAAGACTG GGGAATCTCA ACCAAGGCGG TGCATGCCGG TGAGATCCGG 
TATAACGAAT ACGGGTCGGT GACCACGCCC ATTGTGCAGA CATCCACGTT TATCTTCAAG
AATATCGACG AGATAAAGAA GCTGGCCGTG GGAGCGGTGG AACGGTTTGA ATACGGCCGG
TACGGCCATC CCACCCAGAT CGCCGCGGAA CACAAGCTGG CCTTTCTGGA AGGGGCAGAG
GACGCGGTGC TGTTTTCTTC GGGCATGAGC GCCATCACCA CCACCCTGTT CGGCCTGCTC
AAGTCCGGGG ACCACATCAT CATCACCGAC GACGCCTACC GCCGCACCCT GGAGTTCTGC
AAGGCCTGCC TGGTCAAGTT CGACATCGAG TGCACGGTGG TGAAAATGTG CGATTACGAG
GCCATGGAAA AGGCGATCAA ACCCAACACC CGGCTCTTTT TCTCCGAGTC ACCCACCAAC
CCCTACCTCA ATATCATGGA CCTGGAACGG TTGATCGGTA TTTTTAAAGA CAAGGGCATT
CTGGTGGTGT CGGACAGCAC CTTTGCCACG CCGTATAACC AGAAACCCCT GGAGTACGGG
GTGGATATCG TGATTCACAG CGCCACCAAG TACCTGGCCG GTCACAACGA CCTGCTCAGC
GGCGTGGTGC TGGGCAGCAA GAAGCTGGTG GAGCCGGTTC GGGAATTCCT CAAAATCACC
GGCGGGGTGA TTGATCCCAA TTCGGCCTAC CTGCTGATCC GGGGCCTGAA GACCTTTGGC
CTGCGCATGG AGCGGCTCAA TGAAAACGGC CAAATCGTTG CCGAAGGACT GGAACGGCAT
CCCAAGATCA GCCGGGTTTA CTATCCCGGC CTGCCCAGCC ATCCCCACCA TGACGTGGCA
AAGGCCCAGA TGAAGGGGTT TGGGGCTGTG GTGACCTTTG AGGTGGAAGG CGACTGCGAG
TATGTGCTCA ATTTTTTGAG CCGGCTCAAG ATCATCAACA TCGGCCCCAG CCTGGGCGGA
GTGGAGTCGC TGATCACCCA CCCGGCCACC ATCAGTTACT ACGATAAAAC CCGGAAGGAA
CGTCTGGCCC TGGGCATCAA GGACGGCCTG ATCCGCCTGG CCGTGGGCGT GGAAAACGCC
GAGGATATTA TTGCCGATAT CGAGCAGGCA CTGGCGTAA
 
Protein sequence
MKGDKDWGIS TKAVHAGEIR YNEYGSVTTP IVQTSTFIFK NIDEIKKLAV GAVERFEYGR 
YGHPTQIAAE HKLAFLEGAE DAVLFSSGMS AITTTLFGLL KSGDHIIITD DAYRRTLEFC
KACLVKFDIE CTVVKMCDYE AMEKAIKPNT RLFFSESPTN PYLNIMDLER LIGIFKDKGI
LVVSDSTFAT PYNQKPLEYG VDIVIHSATK YLAGHNDLLS GVVLGSKKLV EPVREFLKIT
GGVIDPNSAY LLIRGLKTFG LRMERLNENG QIVAEGLERH PKISRVYYPG LPSHPHHDVA
KAQMKGFGAV VTFEVEGDCE YVLNFLSRLK IINIGPSLGG VESLITHPAT ISYYDKTRKE
RLALGIKDGL IRLAVGVENA EDIIADIEQA LA