Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dole_0942 |
Symbol | |
ID | 5693777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Desulfococcus oleovorans Hxd3 |
Kingdom | Bacteria |
Replicon accession | NC_009943 |
Strand | - |
Start bp | 1098694 |
End bp | 1100559 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641263539 |
Product | extracellular solute-binding protein |
Protein accession | YP_001528829 |
Protein GI | 158520959 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGAAAA TTAAGTATCT GATATCGGCG CTGATGGTCC TGTGCATGCT CCTCGCCTGG GGCTGTTCTT CAGACAAAGA TGAGCAGCAG GCCGCACCGG AGCCGGCCCA TTCACCCTTC CGGGCCCCTC TGGCCGACAC CGATCTTCCG GAAAATATAG ACTGGCTGAC CAACGACACC GACCCGGTGT TTGCCTCTCC GGACGCGGTC ACCGGCGGCA CCCTGCGACT GTCCCTGCTG AGTTTTCCCT TAACCTTCCG GGTGGTGGGG CCGGACTCCA ACAGCAGCTT TCGCAGCGCC ATCCTGGGCA ACCAGCTCTC CCTGATCAGC ATTCATCCCA ACACCGAGCG CATTGTACCG GAAATCGCCA CCCACTGGGC ATTTGGCAAA GACAAAAAAA CCATGTACTT CAGGCTCAAC CCTGATGCCC GGTGGTCCGA CGGCATGCCG GTGACGGCCC ACGACTTTGC CTACACCCTG GCGTTCATGC GGTCCCCGCA CATCGTGGCG CCCTGGTACA ACGACTACTA TACAAAAGAA ATAGAAAAGG TCGTGGTGTA TGACGACCAC ACCCTGGCCG TGGTGGCGTC CAAACCCGAC CCCGACCTTT ACCTGAAGGT GGGCATCTCC CCCACACCCA GCCACTTTTA CGGCGACCTG GACGAAAATT TCGTGCGGGA CTACAACTGG AAAATCGCCC CCAACACCGG GCCTTACCAG GTATCCGGGT TTGAAAAGGG AAAGTTTGTG GAATTTTCCC GAAAGCCCGA CTGGTGGGCC TCTGATCTGC GTTTCTTTAA AAACCGGTTC AACGTGGACA AGGTAGTCTA CACCGTTGTC AAGGACTTCA ACATGTCCTG GGAGTACTTC AAGCAGGCCG AGATCGACGC GTTTCCCCTG ACCATGCCCG ACTTCTGGCA CGACAAAACG CAAACCCCGG TCTTTGACAA GGGGTATGTG CATCGCATCT GGTTTTTCAA CGACACCCAG CAGTCGGCCA TGGGCATGTG GCTCAACCAG GATATCGATA TCTTCAAGGA TGTCCGGGTA CGTTACGCCT TTGCCCATGC CATGAACATT CAAAAGGTCA TCGACACCGT GCTGCGGGGC GACTACTTTC GCCTGGAGCA GGGATTCGTG GGATACGGCG ACTACACCAA CCCCGATGTC AAGGCCCGGC GGTTCGACCT TGAAAAGGTC AATGACTACA TGACCGGTGC GGGGTGGCAG CGGGGGACGG ACGGCATCTG GGAAAAAGAG GGCATGCGGT TTTCCGTGGA TGTGACCTAC AGCCATGATG GCCACACCCC GCGGCTGGTG GTGTTAAAAG AGGAGGCCCG GAAGGCGGGC ATCGAACTCA ACCTGCGCCG GCTGGACCCT TCGGCGTCCT ACAAGCAGGT GCTGGAGAAA AAGCACGAGG TGGGGTGGGT GGCATGGAGC ACGTCCCTGA GGCCCCAGTA CTGGGAGCAT TTTCATTCGG AAAACGCCCA CAAGCCCCAG ACCAACAACA TCACCAACAC CGACGACCCG GAGATGGACC GGCTGATCGA GGCCTACCGC GCCAGCCTGG ACGCGGAGGA ACGAAAGGTT TTATCCCGGC AGATTCAGGC GAAAATTCAC GAGATCGGCG CCTTTGTGCC CACCTTCATG GTGCCCTATG TGCGCCAGGC CTGCTGGCGA TGGTGGCGAC TGCCCGACCC GCCGGGCACC CGGCACACCG ATGACCTGTT CGAACCCTTT TCCAGCGGCA CCGGCGGCCT GTTCTGGTTT GACCGGAAAC TGTACGACGA GACTCAGGCG GCCATGAAAG CCGGCACCGC CTTTGACCCG GTGACCATCG AGGACAGAAC ATTTTATCGG CAATAA
|
Protein sequence | MPKIKYLISA LMVLCMLLAW GCSSDKDEQQ AAPEPAHSPF RAPLADTDLP ENIDWLTNDT DPVFASPDAV TGGTLRLSLL SFPLTFRVVG PDSNSSFRSA ILGNQLSLIS IHPNTERIVP EIATHWAFGK DKKTMYFRLN PDARWSDGMP VTAHDFAYTL AFMRSPHIVA PWYNDYYTKE IEKVVVYDDH TLAVVASKPD PDLYLKVGIS PTPSHFYGDL DENFVRDYNW KIAPNTGPYQ VSGFEKGKFV EFSRKPDWWA SDLRFFKNRF NVDKVVYTVV KDFNMSWEYF KQAEIDAFPL TMPDFWHDKT QTPVFDKGYV HRIWFFNDTQ QSAMGMWLNQ DIDIFKDVRV RYAFAHAMNI QKVIDTVLRG DYFRLEQGFV GYGDYTNPDV KARRFDLEKV NDYMTGAGWQ RGTDGIWEKE GMRFSVDVTY SHDGHTPRLV VLKEEARKAG IELNLRRLDP SASYKQVLEK KHEVGWVAWS TSLRPQYWEH FHSENAHKPQ TNNITNTDDP EMDRLIEAYR ASLDAEERKV LSRQIQAKIH EIGAFVPTFM VPYVRQACWR WWRLPDPPGT RHTDDLFEPF SSGTGGLFWF DRKLYDETQA AMKAGTAFDP VTIEDRTFYR Q
|
| |