Gene Dole_3209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3209 
Symbol 
ID5696071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3845848 
End bp3847218 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content64% 
IMG OID641265828 
Producthypothetical protein 
Protein accessionYP_001531089 
Protein GI158523219 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000788974 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATTC CGGAAATCAT CTACAGTGCG GCCGCCGGCG CCCTTGCCGG CATTTTTTTC 
ACCTGGCTGG TCATGCGGGC CCGCCGGGCT GTGCTGGCCG ACCGGCTGGC GAGTGCCACC
ACTGAAATTG AAACCACCAG AACCGAGCGG GATGCTTTGC GCGGCGAGCT GTCCGACCAC
TCGGCCCGGT TCGCCCGGCT TGAATCGGCC CTGGAGCAGG AGCGGGAAAA ATCAAAAGAG
ATGGCCGCCT TTGCCCAGGC CGCCACGCAA ACCCTGAAGG ACACCTTCAA GGGGCTTTCC
GCCGACACCC TGGCCCAGAG CAGCGAGCAG TTTCTGCACC TGGCCAAAAG CGCGTTTGAG
TCCTTTCACG TCAAGGCGTC CGGTGATCTG GCCCAGCGGC AGAAGGCCGT GGAAGAGATT
GTCCGGCCGG TAAAAGAGGC CCTGGACAAG GTCAACACCC AGGTGGCCGA GGTGGAAAAG
AGCCGCAAGC AGGCCTACGG GTCGCTGACC GCCACGGTGG AGTCCCTGCT GCGCGGCCAG
AAGGAGCTTT CCACGGAAAC CGGCAACCTG GTCTCGGCCC TGCGCAAGCC CATGGTGCGG
GGCCGGTGGG GCGAGATCCA GCTGCGCCGG GTGGTGGAGT TTGCCGGCAT GCTGCCCCAC
TGCGACTTTG TGGAGCAGAG TTCGGTGAAG ACCGAAACCG GCACCCTGCG GCCCGACATG
CTGGTCCGCC TGCCCGGCGG CAAGCTGGTG GTGGTGGATT CAAAGGCCCC GCTGGAGGCC
TATCTTTCGG CGGTCAGCGC CGAAGACGAG GCCACGCGCA AAAAATTCAT GGCCGACCAC
ACCCGGCACC TGCGCACCCA TATTCAGCAA CTGTCGGACA AGGCCTACTG GGAGCAGTTT
GACCAGGCCC CGGATTTCGT GGTGCTGTTC CTGCCGGGCG AACCCTTTTT CAGCGCGGCC
CTGGAGCAGG ACGAAGGGCT CATTGAGTTC GCCGTGGCCC GCCGCATCAT TCTGGCCTCC
CCCACCACCC TGATCACCCT GCTTCAGGCG GTCTCCTACG GCTGGCAGCA GGAGCAGATC
GCGGAAAACG CCCGCCACAT TCAGGAGCTC GGGGCCGATC TGTACCGGCG GATTTCAAAG
ATGGCCGACC ATTTCGGCAC GGTGGGAAAA TCCCTGGACA GGGCCGTCAA AAGTTACAAC
GATGCCGTGG GCTCCCTGGA GGCCCGGGTC CTGCCCGCGG CCCGCCGCTT TTCCGAGCTG
GACACCAGCA TCAAAAACGA GATTCCCAAG ATCGAGCCGG TGAATGTGGT GTCAAGAGAT
ATCTCCGCTC CGGAGCTGAT TGAACCGCCG GAGGAGGAAG AGGAGACCTG A
 
Protein sequence
MHIPEIIYSA AAGALAGIFF TWLVMRARRA VLADRLASAT TEIETTRTER DALRGELSDH 
SARFARLESA LEQEREKSKE MAAFAQAATQ TLKDTFKGLS ADTLAQSSEQ FLHLAKSAFE
SFHVKASGDL AQRQKAVEEI VRPVKEALDK VNTQVAEVEK SRKQAYGSLT ATVESLLRGQ
KELSTETGNL VSALRKPMVR GRWGEIQLRR VVEFAGMLPH CDFVEQSSVK TETGTLRPDM
LVRLPGGKLV VVDSKAPLEA YLSAVSAEDE ATRKKFMADH TRHLRTHIQQ LSDKAYWEQF
DQAPDFVVLF LPGEPFFSAA LEQDEGLIEF AVARRIILAS PTTLITLLQA VSYGWQQEQI
AENARHIQEL GADLYRRISK MADHFGTVGK SLDRAVKSYN DAVGSLEARV LPAARRFSEL
DTSIKNEIPK IEPVNVVSRD ISAPELIEPP EEEEET