Gene Dole_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3229 
Symbol 
ID5696092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3872216 
End bp3873397 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content66% 
IMG OID641265849 
Producthypothetical protein 
Protein accessionYP_001531109 
Protein GI158523239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0121849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC GGTGTGTCGT GTCTGTGTCG TGGCTGGTGG CCGTGTGTGG CATGGCTGTT 
GCGGGCCTGT TTTTTTCTCC GGCTTTGGCA AGGTCTGCCG CCGGTGGAGA CACTGCCGTG
ACCGTGAATG TGGTGGGCAC CGGCGTTGAG ACGCAGGATA CGGATGCGGC CGCGGCCAAG
CGGGAGGCCA TTGACAACGG GCTCTCTTTG GCCGTGGACG AGGTCATGCG CCGGGTGGTG
ACCCAGGAGA TTCTGGCGGC CAATTTTGCC GACCTGGACG CGGCGGCCCG GGCGGTTGAA
GGCCAGGCCA TCCTGACCTA CCAGGTGCTG GCCGATGCCC GGCGGGAGGA TGTGTGCCGG
GTCCTGGTCC GGGCCTCGGT GTCAACGGAC CAGTTGCGGC GGCAGGTGCT GCGGGCCGGC
GTGCTGCCGG ACAGGGAGAA TATGCCGGGT GTAATGCTGC TTCTGCTGGA TGCCGACAAT
GACGGCCCGG CCGTGGCTGC CAACCAGGCC ATGACCGACA GCCTTTTAAA AAGGGGGTTT
CGGCCGGTGG CCGGATCAGA AAGCCTGCTG GCCGGGGAAT CGGGCCAGGC CGGGGCGGAC
ATGGCGCCGG CCCGGTTCGT GTCGCTGGGC CGGGAAATGG GAGTGGATTT CGTGGTGACC
GGTCATGTGG CGGCCACTCC GCCGGTGCGC ACCGGCAGGG AGTCGAAAAC CGGATGGCAG
GGCACGATCA ATGCCCGGGT GGTGAGCACG GATACAGGCC GGGAGGTGTT TTCCCTGGCC
ACCGAGGTGA TGCCGCCTGA CGAAGAGACG CTCTTTTTTG ACAAGGCCAT GGTGCAGGCC
GCAGCCGGGG CCCGGGCCGC AGCCGGGCTG TCGCCCGCCA TGGCCGTGGC CTGGGACCGG
CAGCAGGTGC AGACCCGGAG TTTTGATGTC ACGGTGCGGG GCGTGGGCTA CCTTGCCCAG
CTGGGGTCGT TTCGGTCGGC CGTGGAGTCC CTTGCCCCTG TCAAGCGGGT CCAGATACGG
GAAATGAAAA TCGACGAGGC CCTGGTGGTG GTTCAGGCTA TGGGCGGGGC CGAGGCCCTG
GCCGGTGCCA TTGGCGCGAT CCGGGCCGAC GGGTTTTATG TTCAGGTGCT GGATGTGGTT
GAGAACCGCA TGACCGTTGA AATTATTTCA GACGGCCACT GA
 
Protein sequence
MKNRCVVSVS WLVAVCGMAV AGLFFSPALA RSAAGGDTAV TVNVVGTGVE TQDTDAAAAK 
REAIDNGLSL AVDEVMRRVV TQEILAANFA DLDAAARAVE GQAILTYQVL ADARREDVCR
VLVRASVSTD QLRRQVLRAG VLPDRENMPG VMLLLLDADN DGPAVAANQA MTDSLLKRGF
RPVAGSESLL AGESGQAGAD MAPARFVSLG REMGVDFVVT GHVAATPPVR TGRESKTGWQ
GTINARVVST DTGREVFSLA TEVMPPDEET LFFDKAMVQA AAGARAAAGL SPAMAVAWDR
QQVQTRSFDV TVRGVGYLAQ LGSFRSAVES LAPVKRVQIR EMKIDEALVV VQAMGGAEAL
AGAIGAIRAD GFYVQVLDVV ENRMTVEIIS DGH