Gene Dole_1997 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1997 
Symbol 
ID5694837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2418287 
End bp2419567 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content55% 
IMG OID641264595 
Producthypothetical protein 
Protein accessionYP_001529878 
Protein GI158522008 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000131605 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACTTT ACAGAAGAAG ATTTTGGGTT GTGGCCCTGG TTGCGCTTTT GGTATGCCCG 
GCCGGCGCCG TTGGCATGAC ACTGGAAGAC CTCCAGATCC ATGGTTTTAT CAGCCAGGGG
TTCCTCTATT CAACAGACGA CGCGGACTTT CTGGCAAAAG ACTCCCACAA GGGGACACTG
GAATTCAATG AAATGGCCAT CAACTTTTCC GCCACTCCCA CCGATGATCT TTCCGTGGGG
ATGCAACTGG CCGCCTTTGA CCTGGGCACG ATCGGTAACG ACGAGGTGAT GGTGGACTGG
GCCTTTGGCG ACTATTCGTT CCGTGACTAC CTGGGGCTAA GGGCCGGCAT TATTAAAATT
CCCCTTGGCC TCTATAATGA TGTGCGCAAG ATCGACATGG TGCGCACCAG CATTCTGCTG
CCAACAAGTG TCTATCCGGA ATGGTTCCGG GAGGCCTTTG CCCGGATTAA GGGGGTGGGG
CTTTATGGCA CCCTGCCCGG CAACATCTCC TATCAGGCGT TGTATGGCAC TGTGGATATC
CAGACGGACG GGGGCCTGTC CGACGGTCTG GAGTCCCTGA TGGAGGGTTT AGGGGGCATG
GACACCAACT ATACGGATAC AAACTGTGCC TATGCCGGCA AACTCCAGTG GGACGCGCCC
GTGGGTCTCA AGCTGGCCGC CAGCGTATAT ACGCTGGATG GTTTAGAGAC AAACATGAAC
AGCATCAATT ATATCGATCC GGCTCCATTG GGGCTGCCGC TTCCTGTATA TCTGCCCGTT
GCCATGGACG CCTACATGCG TTTTGAACCG ATCACCACCT GGGTGCTGTC CGCCGAATAC
ATAACCGACC GGCTCACCCT GGCCGCTGAG TACGCCGAAT ATGACCTTGA GTTTAACGTC
GACATCACGA CTAACCTGGA TCCGGCGTTC AGCGCGTTCA TGGGCATTCC TCCCCGGGTG
GGGGACAAAA CCACCATGCA GGGCTATTAT GGCAGTGCTT CCTACCGCGT GCTCGACAAC
CTGGAGGTCG GCACCTATTA TTCCGAGTTT TATTATGACA AGGATGACCA TGACGGCGGC
AAATATGCCG CCAAATACGG TTTACCGAAA TACAATTCAT GGCTCAAGGA CACCTGCCTG
TCGGCCCGTT ATGATATTTC ACCCAACTGG TGTGCCAAGA TCGAAGGCCA CCTCATGGAC
GGCACTTACC TGGCCCTGGG CGCCCCTGCC GGCGTCGATT CCTGGGAACT TTACGCGGCC
AAGCTGACTT ACAGCTTCTA G
 
Protein sequence
MGLYRRRFWV VALVALLVCP AGAVGMTLED LQIHGFISQG FLYSTDDADF LAKDSHKGTL 
EFNEMAINFS ATPTDDLSVG MQLAAFDLGT IGNDEVMVDW AFGDYSFRDY LGLRAGIIKI
PLGLYNDVRK IDMVRTSILL PTSVYPEWFR EAFARIKGVG LYGTLPGNIS YQALYGTVDI
QTDGGLSDGL ESLMEGLGGM DTNYTDTNCA YAGKLQWDAP VGLKLAASVY TLDGLETNMN
SINYIDPAPL GLPLPVYLPV AMDAYMRFEP ITTWVLSAEY ITDRLTLAAE YAEYDLEFNV
DITTNLDPAF SAFMGIPPRV GDKTTMQGYY GSASYRVLDN LEVGTYYSEF YYDKDDHDGG
KYAAKYGLPK YNSWLKDTCL SARYDISPNW CAKIEGHLMD GTYLALGAPA GVDSWELYAA
KLTYSF