Gene Dole_0589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0589 
Symbol 
ID5693413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp659151 
End bp660308 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content63% 
IMG OID641263175 
Producthypothetical protein 
Protein accessionYP_001528476 
Protein GI158520606 
COG category[S] Function unknown 
COG ID[COG2006] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000367442 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGGGC TACCGATGGA CGGGACAAAA GGAGCAAGAG TGAAAACCCA GGACCGCGTT 
CTGATAATGG ATGCCGCCTA CGAGGCCGAG GTGATGGCCG ACGTGGTGGA CCGGGTGTTT
GACACCTTTC CCCTGGACCT GGCCGGAAAA AACGTGCTGG TCAAGCCCAA CATCCTGTCC
GGCTATGCCC CGGAAAAGGC GGTGACCACC CACCCGGTGC TGGTGAGTGC CGTGGTGGAA
AAGTTGCGCG GAGCCGGAGC CCGGGTCATG GTGGGGGACA ACCCCGGCCT GCACGGCTAC
GGCCGGTCGG AAAAGGCGGC CCGCATCGCC GGCATCCTGC AGGCGGCCGG TGACAGTTTT
ATCAACCTGG GTGGCCGGCC GGTGCGCCAC ACCGTTTCCT CCCGTGTTAT CGACCACGTG
ATGATCGCGT CCGAGGTGCT CTCCGCCGAC CTGGTGATCA ACCTGCCCAA GCTCAAGACC
CACGGCCTCA CCTATTTTAC CGGGGCCGTG AAAAACACTT TCGGCTACGT GGTGGGCGGG
GACAAGATGC GGGTCCATGC CGACGCCCCC ACGCCGGTAA AGTTTGCCGA AGCCCTGGTG
GATATTTTTT CGATCCGGCC TCCGGACCTG ACCATCATGG ACGCGGTCCT TGCCATGGAG
GGCAACGGCC CCAGCAGCGG CGCGCCCCGG TTCGTGGGCA AGGTCCTGGC CGGCACAAAC
GCCGTGAGCC TGGATGCCGC GGCCGTCACC CTGGTCGGAA AAAAGCCGGC CCGCATTCCC
CATCTTCGCA TTGCCGCGCA AAAGGGGCTG GGCCCCATCG ACATGGCCGA CATTGAGATC
AACACGCCCA TCGTGCCGGT GGCCGGGTTT AAAATGCCGG TCACCTTTCT GCCCGGCATC
ATGGGCGTGG TGCTCAACCG GATTCTGTCC CGGCGGATGA ACTGCACGCC GGAGGTGGTG
GAGGCGGTGT GCAAACAGTG CGGCATCTGT GTCAGCCACT GCCCGGTAAA CGCCATGACC
ATGGCGGAGG GCGAATTTCC CAGGGCCGAC GCTGCCGCCT GCATTCACTG CTACTGCTGC
CAGGAGATGT GTCCCGAACA TGCCATTGAA CTTACGGGCC GGGTCATGAG TTTTTTCCGG
CGCCGTAACA TTCATTAA
 
Protein sequence
MVGLPMDGTK GARVKTQDRV LIMDAAYEAE VMADVVDRVF DTFPLDLAGK NVLVKPNILS 
GYAPEKAVTT HPVLVSAVVE KLRGAGARVM VGDNPGLHGY GRSEKAARIA GILQAAGDSF
INLGGRPVRH TVSSRVIDHV MIASEVLSAD LVINLPKLKT HGLTYFTGAV KNTFGYVVGG
DKMRVHADAP TPVKFAEALV DIFSIRPPDL TIMDAVLAME GNGPSSGAPR FVGKVLAGTN
AVSLDAAAVT LVGKKPARIP HLRIAAQKGL GPIDMADIEI NTPIVPVAGF KMPVTFLPGI
MGVVLNRILS RRMNCTPEVV EAVCKQCGIC VSHCPVNAMT MAEGEFPRAD AAACIHCYCC
QEMCPEHAIE LTGRVMSFFR RRNIH