Gene Dole_3233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3233 
Symbol 
ID5696096 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3876779 
End bp3877813 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID641265853 
ProductApbE family lipoprotein 
Protein accessionYP_001531113 
Protein GI158523243 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000188784 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGGC GTTTAACAAA ATATTCAGGC CGGTGGATGC TGGCCGCGGC CTGCCTCTGC 
CTGGCTTTTG CCGGGTGCGA CGGCTCACGG CACAAGACCT TTTCCGGAAA AACCATGGGC
ACCGAGTACC ATGTCACGGT GGTGACCGGA ATGCTGTCAC GCACCGCGCC CCTGCAAAAG
AAGGTCGAGG CCCGGCTGGC CCACATCAAC GCCGGTATGT CCACGTATAT GGACACCAGT
GAGATTTCCC GGTTTAACAA CGAGATCGGC CAGGACCAGC CCTTTGCCGT GTCCAAAGAT
TTTTTGCGGG TGGCTGCCGA AGGCATGGCC CTGTTTCGGC TGACCGATGG GGCCTGGGAC
GGGTCGGTGT GGCCCCTGAT GATCCTGTGG GGGTTTGACC GGCCGGAGCA GCAGCGTTTT
GTACCGGATT CGGCCGAAAT CGACCAGGTG CTGACCTGCG TGGGATACGA TTCGCTTCAG
ATTGATGAGG CAAACCGCCT GGTGAAAAAA ACGCCCTGCC TGTTTCTGGA CTTTGCCTCC
ATTGCCAAGG GGTATGGCGT GGATGTCGTG GCCGAGGTGC TTCGGGAGGC CGGTGTCGAC
AATTTTATCG TGGAAGTCGG GGGCGAAGTG TATGCCGCCG GTGTACGGGA AACCGGGGAT
CCCTGGCGTA TCGGCATCAA CACGCCTGAA CCGGGTGCGC CGGTGGACCG GGTGCGCCAG
GTGGTGGCCC TGTCCGACCG GGCCATGGCC ACCAGTGGTG ACTACCGGAA CTATTTTGTG
ATCGACGATC GGACCTACAG CCATGTGCTG GACCCCCGGA CCGGTTATCC CGTGGCCAAT
GGTGTGGTCA GCGCCACTGT GGTGGCCGAC ACCTGCACCT TTGCCGACGG ACTGGCCACA
GCCCTGATGG TGATGGGGGC CGAACCCGGA ACGGCCCTGG TAAACACCCT GGAAAACGTG
GAGAGCTGCA TCACGGTCCG CCGGACCGAC GGCACGTACG AGGATTTTTG GTCAACCGGA
TTTGTCGCGC AGTAA
 
Protein sequence
MMRRLTKYSG RWMLAAACLC LAFAGCDGSR HKTFSGKTMG TEYHVTVVTG MLSRTAPLQK 
KVEARLAHIN AGMSTYMDTS EISRFNNEIG QDQPFAVSKD FLRVAAEGMA LFRLTDGAWD
GSVWPLMILW GFDRPEQQRF VPDSAEIDQV LTCVGYDSLQ IDEANRLVKK TPCLFLDFAS
IAKGYGVDVV AEVLREAGVD NFIVEVGGEV YAAGVRETGD PWRIGINTPE PGAPVDRVRQ
VVALSDRAMA TSGDYRNYFV IDDRTYSHVL DPRTGYPVAN GVVSATVVAD TCTFADGLAT
ALMVMGAEPG TALVNTLENV ESCITVRRTD GTYEDFWSTG FVAQ