Gene Dole_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1145 
Symbol 
ID5693979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp1361474 
End bp1363693 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content52% 
IMG OID641263738 
Productglycoside hydrolase family protein 
Protein accessionYP_001529028 
Protein GI158521158 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTTT TCATGACAAT CGTTTCAGTT TTTTTTAGCC TTTCCATGGT TGCGCAGCCA 
CTGTTCGCCA TGGCGCCGGC CCCGCCGCCG CCGTCTTTAA GATCGCAGTA TGTGGTTTCC
GGCAATAACG TGTCTGCTGT AGTAGAAACC CGGCCTTTTC GCCTGCTCAT CAAGGATGGC
GGCGGGCGAA CCGTACTGGC TTCGATCAAT AGCAACACCC CGATTTTCAA TCCGGATTGT
AATTACTGCG AACCCGACCT GATGGGAGAG TTCTGGTCTT TTCCACCGAT TATCTGTGAC
AGCTATCACC CGTTGCACTT TGAAATCGGC GGGCCTGAAA CCTTTGACTT TATCGATCTG
CTTGTCGTCG AGCCGATGCA GGTCTATCGC GACGCCAAAC TCTGTTTTGC CACGGACGTG
ATCAAGGTGA GTGATTCCGG TAACGGGAAA AAATTCAAAT TAAAAACCAC AAAAGACAAC
ACAACAGTTG ATGTCCTTGT CGAACCGGAC CCGTCGGGCG TGAAAGCCAT ACGGATCTCA
GCCACGGTAA ATGACCCGCG CGCCCAAAAC ATCAGTTTCG CTTTTACCAG TCATCACAAT
GAATCCTTTT ATGGATTCGG GGGGCGTCGC GGCACACTTG ATCAAAAAGG TCATGCGCTC
TATTCATGGA CCATGGACGC CATGGCACAG AACGCTATAG TGGGCAAATA CTCTCTTTCC
CGGGCCTATG GACCGCAGGC GCTGTTCTAT TCTTCGGAAG ACTACGGTTT TCTGCTTGAA
AACAGTGAAC TGGCGCGGTT CTATATGGGA AATGATCGCG AAGATGCCTG GAAAGTCAAT
GTCTCTTCCA ACAGCGCCGC CTTTGTTGTG AGTGCCGGGG ACCACAAACA GAACATCGAA
AGTATAACGG CCATTAACGG CCGTCACCGC CCGATTCCCG ATTGGGCCAA AGGATTGATA
TTTGCCCAGC GATCGCCCAT AACCTTAATT GGTGACGCCG AACCCGGGGC CTATTTCAGA
GACGCCATGG CGTATTTACA GAAATTAACG GAACTGAATA TTGAGACAAG CGGATACCTT
ATCGAGGCAT GGGCATCTTC CGCCAATCTT ACCAAAGCAG AGCTGGACCT GTTGCTGGCA
GAGCTTAATC GTCTGGATAT CAAGCCATTA ACCTATATGC GGATGATGCT GACCGATGAT
TCTCTGAACA CGGAAGATCC GGAAATCTAT TACCAGGCCT ACGACAATGG TTATATGCCC
ACCCGGGCAG ACGGTTCACC CTATGCCTAC CCGGTACTTA TGGCACCCAC CAGTGTGACG
GATTATACCA ACCCGTTTGC GCTCGACTGG TGGGAACAGC GGGTTACCGG TATGCTGGAT
CTGGGCAGTG GCGGCTTTAT GTTTGACTTT GGCGAACAGG TCCGGCCTGA CATGCAGTTC
TATAATGGTG AAACCGGACG CAGCATGCAT AACAGGCTGC CGGTCCTGGG CAATAAAGAA
ACGGCCCGGA TTGTCGATGA CTACGAACAG GCCCATCCGG GGCGGGACAT TTTCTTTTTT
ACCCGCGCCA ATTACTCCGG GCGACCCGGT TCAGCGGCCT ACGAAAACGC CCAGTTTCTC
GGGGACAATA CCCAGTCCTG GGATGCCGGC ACCGGCCTTA AGTCGGTCTT ACCGGATATT
CTCAACCGGA GCCTGGGCGG TGCCTATAAT ACGACAACAG ATATCGGGGG GTATTGGGAT
CTCTACGGCG TTGCCGGTAA AGAACTCTTT ATCCGCTGGA CCCAGCTCGC AACCTTTGGG
TCCGTATTCA GGCTTCATAA TTCACCTTTC ACGCCGCTGA AAACACCCTG GTCCTATGAT
GATGAAACGG TACGGATTTT CAAGTCCGTT CTTGCGCAGA GAAAAAAAGC CATGCCGTAT
ATGAACACGC TCTGGGAAAC CGCGGCTGCC ACCGGACTGC CGCTGTGGCG GCCCATGTGG
CTGGAGTTCC CTGACGACGA TCGATTTCGA AACGAAATGG GGCAATTCAT GTTAGGCGAC
AAGGTTTTGG TTGCCCCGGT ATTGGACCGG GGGAAACGCA CCAAATCGGT AAAGTTGCCA
GAAGGGTGCT GGCAGTATAT AAATACAGGC AAGGTTTATC AGGGCGGGCA GACGGTTATC
GTGGGTGCCC CCCTGGATGT TTTGCCCTGT TTCTTCAGGT GCGGAGAGTC TCCTTTTTAA
 
Protein sequence
MQFFMTIVSV FFSLSMVAQP LFAMAPAPPP PSLRSQYVVS GNNVSAVVET RPFRLLIKDG 
GGRTVLASIN SNTPIFNPDC NYCEPDLMGE FWSFPPIICD SYHPLHFEIG GPETFDFIDL
LVVEPMQVYR DAKLCFATDV IKVSDSGNGK KFKLKTTKDN TTVDVLVEPD PSGVKAIRIS
ATVNDPRAQN ISFAFTSHHN ESFYGFGGRR GTLDQKGHAL YSWTMDAMAQ NAIVGKYSLS
RAYGPQALFY SSEDYGFLLE NSELARFYMG NDREDAWKVN VSSNSAAFVV SAGDHKQNIE
SITAINGRHR PIPDWAKGLI FAQRSPITLI GDAEPGAYFR DAMAYLQKLT ELNIETSGYL
IEAWASSANL TKAELDLLLA ELNRLDIKPL TYMRMMLTDD SLNTEDPEIY YQAYDNGYMP
TRADGSPYAY PVLMAPTSVT DYTNPFALDW WEQRVTGMLD LGSGGFMFDF GEQVRPDMQF
YNGETGRSMH NRLPVLGNKE TARIVDDYEQ AHPGRDIFFF TRANYSGRPG SAAYENAQFL
GDNTQSWDAG TGLKSVLPDI LNRSLGGAYN TTTDIGGYWD LYGVAGKELF IRWTQLATFG
SVFRLHNSPF TPLKTPWSYD DETVRIFKSV LAQRKKAMPY MNTLWETAAA TGLPLWRPMW
LEFPDDDRFR NEMGQFMLGD KVLVAPVLDR GKRTKSVKLP EGCWQYINTG KVYQGGQTVI
VGAPLDVLPC FFRCGESPF