Gene Dole_2204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2204 
Symbol 
ID5695050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2676843 
End bp2678243 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID641264808 
Producthypothetical protein 
Protein accessionYP_001530085 
Protein GI158522215 
COG category[S] Function unknown 
COG ID[COG3034] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCGGA TAAACATGGT GTGGTATCGG GATTTATTGA AAGGCATCGG GCGATTTTTT 
TGTGTGAACA TGCCGGCGGC AGCGATGGCG CTGGCGGTAC TGGGCATCTC CGGGACAGGG
GCCCAGGCAT TTTCCATTCC CGACCTGGTG GTCTATTCCG AGGAGGACCG GCCCCAGTAT
GTCATCCTGG TGGAAAAGGC CACCCAGCAG CTTTTCCTGT TTTCTTTTTA CAAAAATTCT
ATTCGCAAGG AAAAGCAGTG GAAGTGCTCC ACCGGTGAGA ACCATGGTCC CAAGCTGGTG
ATGGGTGACA AAAAAACACC GGAGGGCATC TATTTTTTCA CCCGCAAGTA CACAGACAGT
GAGCTGGCGC CGATTTATGG CACCCTTGCG TTCCCTCTGG ACTACCCCAA CGTGCTGGAT
CATGAAGCGG AAAAGAACGG CAGCGCCATC TGGCTTCACG GTACCAATAA AGTGTTGAAG
GACAATGATT CCAACGGCTG CGTGGCGCTG GAGAATGGCA GCATTGACGC GCTGGCCCCG
TATATAGAGC TTTATCATAC CCCCATCGTG ATCGTCCACG AGGTGGCCGA AGCCCCATGG
CAGGCCGTTC CCCGGGTGGA CAAGGCGGTG GGAGCCCTGG TGCGGCAATG GAATGACGCA
CTGGTTTCCG GCACCTACCA CGACTATCTT CGGTTTTATG ACCCCCGTTA CCTGCCCGAC
ATGGGGTGGT GGAAAAAGTG GCGCCGAGTG CGCGGCGAGG TGGCCGGGGA ACTTCCGGAC
CTTTACATTG ACATGGATTC GCTTCTGGTG GTTCGCTATG ACAAAAATTA TGTGGCCCTT
TTTGACCAGG TTGTTTCAGC CGGCGGCAGC TGGGTGAAAG CAGGCAGAAG AAAACTGTTC
CTGGAAGAGA CGACCAACGG ACTGAAGATT GTCGGTGACA CTTTTCAGGG ACCTGAGGTG
AAGGGGGGCG CCCAGAGTCC TTCCAACCGT CTGGTGGCGG CCTGCCGGAA CGTGTATGAC
GTGGCGGTGA CGGAGAAAAG TGTCCGCCGG ATGCTGGACC AGTGGCTGTC GGCATGGAGC
CGAATGGACA TGGACGCATA CGGCACCTTT TATGCCGGCG ATTTTGTTTT CGATGGCATG
AACAAAAAGG GGTGGCTGGC CTATAAGCGG AAATTGAACT GGCAGTATGA CTATATTCGC
GTGGCCATGG ACAACCTGCG GTTTGTGGAG ACATCACCCG CCGTTTGTGT GGTGACCTTC
ATTCAGAAAT ACGAGTCAGA CCGTTACAGC GATATGGGCC TGAAAACCCT GATCCTGAAA
AAGGAGGACG GGCAATGGAG GATTCACCGG GAGTCCTGGG AAGCGTTGGA CCTTCCGCCG
ACACCGAAAA CAGAGGGCTG A
 
Protein sequence
MARINMVWYR DLLKGIGRFF CVNMPAAAMA LAVLGISGTG AQAFSIPDLV VYSEEDRPQY 
VILVEKATQQ LFLFSFYKNS IRKEKQWKCS TGENHGPKLV MGDKKTPEGI YFFTRKYTDS
ELAPIYGTLA FPLDYPNVLD HEAEKNGSAI WLHGTNKVLK DNDSNGCVAL ENGSIDALAP
YIELYHTPIV IVHEVAEAPW QAVPRVDKAV GALVRQWNDA LVSGTYHDYL RFYDPRYLPD
MGWWKKWRRV RGEVAGELPD LYIDMDSLLV VRYDKNYVAL FDQVVSAGGS WVKAGRRKLF
LEETTNGLKI VGDTFQGPEV KGGAQSPSNR LVAACRNVYD VAVTEKSVRR MLDQWLSAWS
RMDMDAYGTF YAGDFVFDGM NKKGWLAYKR KLNWQYDYIR VAMDNLRFVE TSPAVCVVTF
IQKYESDRYS DMGLKTLILK KEDGQWRIHR ESWEALDLPP TPKTEG