Gene Dole_1800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1800 
Symbol 
ID5694640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2179939 
End bp2181285 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content47% 
IMG OID641264398 
Productputative polysaccharide biosynthesis protein 
Protein accessionYP_001529681 
Protein GI158521811 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCCCAC CGTTGATAAT AGGCAGGTTG CTCCATCCAA TCAGGGGTCG TTTCAATAGG 
TGGCACAGGG AGATATCCCG AATCGCAGGG GAACTGAGAA AAAGCGAATT TGCATCACTG
GATGCGCTTT ACAATATCCA GCGGCAGAAA CTTTGCCGCC TCTGCGAAAA TGCAGCTAAA
ACATCGCCCT ATTTCAAAAA CCTTTTTGTC CAATCTGGAC TTGACCCACA AAAAATCGAC
CTGGACACAC TTGCGAACCT GCCGGTGCTG GAAAGGGCTA CCGTCCACCG CCATTGCAAG
GATATGCTAA ACCGGGAGTA TCCAGTAAAC AAAATCACCC AAAACGCCAC TGGTGGCTCC
AGTGGGGCCC CTTTGCGTTT CTATCGCTCA CATCATGACC TTATTTTTGC CGCTGCTATC
CGTGTAAGAG AAATGAACTG GTGCGGCATG AAACCGGGAT ACCCCCATGT AAAATTGTGG
GGTGCGCCCA CGGATGTTCA ATCCGCAACT GCAGGGATAA AGGCAAAAAT ATGGGGCTAC
CTGTATAATC AAAAAACCAT CAATGCATTT GATGCCGGCC CGGCCCTTTT TGAAAGAGAG
CATGGCCTGT TTTTAAGCAA CCCACCTTTC CTGCTGGAAT CTTATTCCAA CATTCTTTAT
GAATTTGCCA GTTACTTAAA AAAGAGCGGG AAAGCGCCGT TGAACCTTCC GGCGGCGATA
TCATCCGCCG GCGTGCTTTA TGATTTCCAG CGGACTGTTA TTCAGGATAC AGTTTCCCGG
AATCTTTACA ACCGATACGG ATGTCGGGAA ATGGGTAATA TTGCCCACGA ATGTGCCAAC
CATTCCGGCC TTCATGTTCA CATGGAGCGC CACATTATCG AAATCATTAA CCCCGATGAG
GATGGTGTAG GGGATATTCT GGTGACAGAC CTGGAAAATC TTGCTTTCCC TTTTATCCGC
TATAGAATTG GAGACAGGGG AAAATTTTCA ATCCAAAAAT GTGCCTGCGG CAGAAATTTA
TTGTTGCTCA AGGAGATTGT CGGCAGAACT CTGGATATTA TTCGAACACC TTCCGGTAGA
TTGATTCCAG GCGAACTGTT TCCCCATTTT TTCAAAGATT ATCCCCAGAT CACCTTGGGG
CAGGTCATTC AGGACCGTAT CGATCATATC GAGCTCCGTT TACGGCTTCA GGAGGGCGCC
GTCCTGGATG ATATCGAGCC ATTGCTCCGG AAAATTAACG AAGCATGCAA TAATGAAGTC
ACCATCACGG TTAATATGGA AGAGGATTTC GTCGTCAATC CGACGGGTAA ATACCGGCCA
GTAATTTCAC ACCTGGAAAG TCAGTAA
 
Protein sequence
MFPPLIIGRL LHPIRGRFNR WHREISRIAG ELRKSEFASL DALYNIQRQK LCRLCENAAK 
TSPYFKNLFV QSGLDPQKID LDTLANLPVL ERATVHRHCK DMLNREYPVN KITQNATGGS
SGAPLRFYRS HHDLIFAAAI RVREMNWCGM KPGYPHVKLW GAPTDVQSAT AGIKAKIWGY
LYNQKTINAF DAGPALFERE HGLFLSNPPF LLESYSNILY EFASYLKKSG KAPLNLPAAI
SSAGVLYDFQ RTVIQDTVSR NLYNRYGCRE MGNIAHECAN HSGLHVHMER HIIEIINPDE
DGVGDILVTD LENLAFPFIR YRIGDRGKFS IQKCACGRNL LLLKEIVGRT LDIIRTPSGR
LIPGELFPHF FKDYPQITLG QVIQDRIDHI ELRLRLQEGA VLDDIEPLLR KINEACNNEV
TITVNMEEDF VVNPTGKYRP VISHLESQ