Gene Dole_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0942 
Symbol 
ID5693777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1098694 
End bp1100559 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content58% 
IMG OID641263539 
Productextracellular solute-binding protein 
Protein accessionYP_001528829 
Protein GI158520959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAAAA TTAAGTATCT GATATCGGCG CTGATGGTCC TGTGCATGCT CCTCGCCTGG 
GGCTGTTCTT CAGACAAAGA TGAGCAGCAG GCCGCACCGG AGCCGGCCCA TTCACCCTTC
CGGGCCCCTC TGGCCGACAC CGATCTTCCG GAAAATATAG ACTGGCTGAC CAACGACACC
GACCCGGTGT TTGCCTCTCC GGACGCGGTC ACCGGCGGCA CCCTGCGACT GTCCCTGCTG
AGTTTTCCCT TAACCTTCCG GGTGGTGGGG CCGGACTCCA ACAGCAGCTT TCGCAGCGCC
ATCCTGGGCA ACCAGCTCTC CCTGATCAGC ATTCATCCCA ACACCGAGCG CATTGTACCG
GAAATCGCCA CCCACTGGGC ATTTGGCAAA GACAAAAAAA CCATGTACTT CAGGCTCAAC
CCTGATGCCC GGTGGTCCGA CGGCATGCCG GTGACGGCCC ACGACTTTGC CTACACCCTG
GCGTTCATGC GGTCCCCGCA CATCGTGGCG CCCTGGTACA ACGACTACTA TACAAAAGAA
ATAGAAAAGG TCGTGGTGTA TGACGACCAC ACCCTGGCCG TGGTGGCGTC CAAACCCGAC
CCCGACCTTT ACCTGAAGGT GGGCATCTCC CCCACACCCA GCCACTTTTA CGGCGACCTG
GACGAAAATT TCGTGCGGGA CTACAACTGG AAAATCGCCC CCAACACCGG GCCTTACCAG
GTATCCGGGT TTGAAAAGGG AAAGTTTGTG GAATTTTCCC GAAAGCCCGA CTGGTGGGCC
TCTGATCTGC GTTTCTTTAA AAACCGGTTC AACGTGGACA AGGTAGTCTA CACCGTTGTC
AAGGACTTCA ACATGTCCTG GGAGTACTTC AAGCAGGCCG AGATCGACGC GTTTCCCCTG
ACCATGCCCG ACTTCTGGCA CGACAAAACG CAAACCCCGG TCTTTGACAA GGGGTATGTG
CATCGCATCT GGTTTTTCAA CGACACCCAG CAGTCGGCCA TGGGCATGTG GCTCAACCAG
GATATCGATA TCTTCAAGGA TGTCCGGGTA CGTTACGCCT TTGCCCATGC CATGAACATT
CAAAAGGTCA TCGACACCGT GCTGCGGGGC GACTACTTTC GCCTGGAGCA GGGATTCGTG
GGATACGGCG ACTACACCAA CCCCGATGTC AAGGCCCGGC GGTTCGACCT TGAAAAGGTC
AATGACTACA TGACCGGTGC GGGGTGGCAG CGGGGGACGG ACGGCATCTG GGAAAAAGAG
GGCATGCGGT TTTCCGTGGA TGTGACCTAC AGCCATGATG GCCACACCCC GCGGCTGGTG
GTGTTAAAAG AGGAGGCCCG GAAGGCGGGC ATCGAACTCA ACCTGCGCCG GCTGGACCCT
TCGGCGTCCT ACAAGCAGGT GCTGGAGAAA AAGCACGAGG TGGGGTGGGT GGCATGGAGC
ACGTCCCTGA GGCCCCAGTA CTGGGAGCAT TTTCATTCGG AAAACGCCCA CAAGCCCCAG
ACCAACAACA TCACCAACAC CGACGACCCG GAGATGGACC GGCTGATCGA GGCCTACCGC
GCCAGCCTGG ACGCGGAGGA ACGAAAGGTT TTATCCCGGC AGATTCAGGC GAAAATTCAC
GAGATCGGCG CCTTTGTGCC CACCTTCATG GTGCCCTATG TGCGCCAGGC CTGCTGGCGA
TGGTGGCGAC TGCCCGACCC GCCGGGCACC CGGCACACCG ATGACCTGTT CGAACCCTTT
TCCAGCGGCA CCGGCGGCCT GTTCTGGTTT GACCGGAAAC TGTACGACGA GACTCAGGCG
GCCATGAAAG CCGGCACCGC CTTTGACCCG GTGACCATCG AGGACAGAAC ATTTTATCGG
CAATAA
 
Protein sequence
MPKIKYLISA LMVLCMLLAW GCSSDKDEQQ AAPEPAHSPF RAPLADTDLP ENIDWLTNDT 
DPVFASPDAV TGGTLRLSLL SFPLTFRVVG PDSNSSFRSA ILGNQLSLIS IHPNTERIVP
EIATHWAFGK DKKTMYFRLN PDARWSDGMP VTAHDFAYTL AFMRSPHIVA PWYNDYYTKE
IEKVVVYDDH TLAVVASKPD PDLYLKVGIS PTPSHFYGDL DENFVRDYNW KIAPNTGPYQ
VSGFEKGKFV EFSRKPDWWA SDLRFFKNRF NVDKVVYTVV KDFNMSWEYF KQAEIDAFPL
TMPDFWHDKT QTPVFDKGYV HRIWFFNDTQ QSAMGMWLNQ DIDIFKDVRV RYAFAHAMNI
QKVIDTVLRG DYFRLEQGFV GYGDYTNPDV KARRFDLEKV NDYMTGAGWQ RGTDGIWEKE
GMRFSVDVTY SHDGHTPRLV VLKEEARKAG IELNLRRLDP SASYKQVLEK KHEVGWVAWS
TSLRPQYWEH FHSENAHKPQ TNNITNTDDP EMDRLIEAYR ASLDAEERKV LSRQIQAKIH
EIGAFVPTFM VPYVRQACWR WWRLPDPPGT RHTDDLFEPF SSGTGGLFWF DRKLYDETQA
AMKAGTAFDP VTIEDRTFYR Q