Gene Dole_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2110 
Symbol 
ID5694953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2562757 
End bp2563755 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID641264711 
ProductKpsF/GutQ family protein 
Protein accessionYP_001529991 
Protein GI158522121 
COG category[M] Cell wall/membrane/envelope biogenesis
[T] Signal transduction mechanisms 
COG ID[COG0794] Predicted sugar phosphate isomerase involved in capsule formation
[COG2905] Predicted signal-transduction protein containing cAMP-binding and CBS domains 
TIGRFAM ID[TIGR00393] KpsF/GutQ family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000613072 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA AAAAAGAGAT CACGGACATC ACTCAGCAGG CCATTGATGT TCTGAAGAAT 
GAGGCAAAAG GAATTCTGGA GGTGGCCGCC AACCTGGACC ACCAGTTTGA AAAGGCGGTG
GACCTGATCT GCCGGTCAAA AGGCCGGCTG GTTGTCAGCG GCATCGGCAA ATCGGGCATC
GTGGGCCAGA AAATCGTGGC CACCCTCAAC AGCACCGGCA CCCGTGCCCT GTTTCTCCAT
CCGGTGGAGG CCATGCACGG CGACCTGGGC ATCGTGGGGC CGAAGGACGT CTTTCTGGGC
CTCTCTAACA GCGGAGAGAC CGAAGAACTT ACCGGCCTGA TTCCCACAAT CCGCAACGTG
GGCTGCAGGG TAATCGCCTT TACCGGCAAT ACCCACTCCT CCCTGGCCCG GCAGAGCGAT
ATCGTAATTA ATGTGGGCGT GAAAAAAGAG GCCTGCCCCC TGGGACTGGC CCCCACTACC
AGCACCACGG CCCTTATGGC CATGGGCGAC GCCCTGGCCG TGTCCTTGAG CATCAGAAAA
GACTTCAAGT CCAGTGATTT CCAGCGGTTC CACCCCGGCG GCTCCCTGGG CCGGCGCCTG
GCCCTCAACG TATCGGAGAT CATGCTCACC GGTGACAGGG TGCCCGCGGT TCCGGTCAAA
ACCCCCATTG AGGAGGCCCT GGCCGTCCTG GACCGTCAGA ACCTGGGGGC ACTGCTGGTG
GTCAGAAAAA ACAACACCCT GGCAGGCATT CTGACAGACG GTGACCTTCG GCGGTTGTAT
CTGGCAAAAG AACCCCTGTC GGGCGGCCCC GTGGACAGCA TAATGACGAA AAACCCTTTG
ACCGTCCATC CGGACTCCCC GGTCTACGAC GCACTGAACA TCCTGGAGCA GCACCAGGTC
ACGGCATTGC CGGTGACCGC CGCCGGCAAA AAGGTGTGTG GCATTCTGCA CCTGCACGAC
ATCCTGGGCA AAGGGGCGTT CAAGTTCAAC GGCCGGTAA
 
Protein sequence
MKQKKEITDI TQQAIDVLKN EAKGILEVAA NLDHQFEKAV DLICRSKGRL VVSGIGKSGI 
VGQKIVATLN STGTRALFLH PVEAMHGDLG IVGPKDVFLG LSNSGETEEL TGLIPTIRNV
GCRVIAFTGN THSSLARQSD IVINVGVKKE ACPLGLAPTT STTALMAMGD ALAVSLSIRK
DFKSSDFQRF HPGGSLGRRL ALNVSEIMLT GDRVPAVPVK TPIEEALAVL DRQNLGALLV
VRKNNTLAGI LTDGDLRRLY LAKEPLSGGP VDSIMTKNPL TVHPDSPVYD ALNILEQHQV
TALPVTAAGK KVCGILHLHD ILGKGAFKFN GR