Gene Dole_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2849 
Symbol 
ID5695707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3433011 
End bp3434222 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID641265464 
Producttype II secretion system protein 
Protein accessionYP_001530729 
Protein GI158522859 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1459] Type II secretory pathway, component PulF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTTT ATCTCTGGGA AGGAAAGCGA AAAGACCAGT CGACCCAGAA GGGCGAGATG 
GAGGCCCCCA GCGAGGAGGC GGTCCGTGCC CAGCTTCAGC GGCAGAGAAT CACGCCGATA
AAAGTCAAGA AAAAACCAAA AGATCTGTTT GAAAACGTCG CCTTTCTTCA GCCCAGGGTC
AAGGAAGAGA ATGTTATTAT TTTTGCCCGC CAGTTCTCCA CCATGATCGA TGCCGGTCTG
CCCATTGTTC AGTGTCTGGA TATTCTTTTT TCCCAGGAGG ACAACCCCAC CTTCAGAAAG
ATTCTTAAAA GTATAAAGGA CAGCGTGGAA GGGGGCCAGA CCCTGGCCGA CGCCATGAAG
CCGTTTCCCG ATCAGTTTGA CAATCTGTTT GTCAACATGA TCGCCGCCGG TGAGGCCGGT
GGTATTCTGG ATGTCATTCT GAAGCGGCTC TCCGCTTACA TGGAAAAGGC GGCCAAGCTC
AAGCGGCAGG TCAAGGGTGC CATGATGTAC CCTCTGATCA CCCTGTCCAT TGCTGTTCTG
GTGGTGATCG TGATTCTGGT GTTCGTCATT CCGGTGTTTC AGGAGATGTT TGCCGATTTC
GGCAAAGAGC TGCCCGCCCC CACCCAGTTT GTCATTTTTC TTTCCGATAC GGTGAAATCC
CAGATTCATT TTATCATCAT CGCCTTTGTA CTGCTTGTCC TGGCCTACCG CCAGTTTTAC
AAGACGGAAA AGGGTCGCCT GTTCATGGAC GGCCTTTTTC TCAAGTTCCC CATTGTAGGT
ATTCTGCTCC GCAAGGTGGC GGTTGCCCGA TTCACTCGCA CCATGGGTAC CATGCTGGCC
AGCGGCGTCT CCATTCTGGA TGCTCTGGAT ATCGTGGCCC GCACCGCCGG CAACAAGGTG
GTGGAAAACG CGGTCTTCAA GGTCCGTTCC GGCATTTCCG AAGGCCGCAC CATGGCCGAT
CCCCTTCTGG AGAGCGGGGT CTTTCCGTCA ATGGTGTGCC AGATGATTTC CGTTGGCGAG
TCCACCGGCG CTCTGGATGC CATGCTGGAA AAGATCGCCG ATTTTTATGA AGAAGAGGTG
GATGCCGCCG TGGAAAGCCT GACCTCTGCC ATTGAGCCGC TGATGATGGT CTTCATGGGC
GGATTGATCG GCGGCCTGGT GGTCTCCATG TATCTGCCCG TCTTCCAACT GGCCGGCGCG
GTATCCGGCT AG
 
Protein sequence
MALYLWEGKR KDQSTQKGEM EAPSEEAVRA QLQRQRITPI KVKKKPKDLF ENVAFLQPRV 
KEENVIIFAR QFSTMIDAGL PIVQCLDILF SQEDNPTFRK ILKSIKDSVE GGQTLADAMK
PFPDQFDNLF VNMIAAGEAG GILDVILKRL SAYMEKAAKL KRQVKGAMMY PLITLSIAVL
VVIVILVFVI PVFQEMFADF GKELPAPTQF VIFLSDTVKS QIHFIIIAFV LLVLAYRQFY
KTEKGRLFMD GLFLKFPIVG ILLRKVAVAR FTRTMGTMLA SGVSILDALD IVARTAGNKV
VENAVFKVRS GISEGRTMAD PLLESGVFPS MVCQMISVGE STGALDAMLE KIADFYEEEV
DAAVESLTSA IEPLMMVFMG GLIGGLVVSM YLPVFQLAGA VSG