Gene Dole_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_2024 
Symbol 
ID5694864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2452252 
End bp2453580 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content62% 
IMG OID641264622 
Productphosphoglucosamine mutase 
Protein accessionYP_001529905 
Protein GI158522035 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1109] Phosphomannomutase 
TIGRFAM ID[TIGR01455] phosphoglucosamine mutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000439649 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGGCGC GGCTTTTTGG CACCGACGGC ATTCGAGGCG CTGCCAACTC CTGGCCCATG 
ACACCGGAAA CAGCCATGGC CGTGGGCAGG GCCGTGGCCC GGTTCATGAC GGCAGACGGT
CAATCCCCCC CCCGGATTCT GGTGGGTAAA GACACCCGGC TTTCCGGCGA CATGCTGGAA
AGCGCCCTGT GCGCTGGTAT CTGCGCTTCA GGCGTGGACG CAATCCGCGT GGATGTGCTT
CCCACCCCGG CGGTGGCCTA CCTTACCGCC ATGCTGAAAG CCGGCGCCGG CATCATGGTG
TCGGCCTCTC ACAACCCCTG GACCGACAAC GGCATCAAGA TTTTTTCCCA CAAAGGGCAT
AAGCTTTCCC CGGTTCAGGA GGCCGAGCTG GAGGCGTTGA TTCTCTCCCC GGAGCCGATG
GCGGCCGCCA ATCCACCGGT GCCCGGCCGG GTCTTTCATC TCATGGATGC CGAAGAACCT
TATGTCGAAT GCCTGAGCAA CATCACCGCG GTCGGCTCCC TCTCCCTGGT ATTAGACTGC
GCCAATGGCG CCGCTGCTCG TGTGGCCCCC CGTCTTTTTC CCGATGCCCG CCTGTTGTCT
GCTGATCCCG ACGGGCGGAA CATTAACGAA AACTGCGGTT CCGAGCACAC AGAAGCGCTT
CGGGCCGAGG TGGTGAAATA CCGTGCCGAT GCCGGATTTG CCTTTGACGG TGACGCCGAC
CGGCTGATCG CCGTGGATGA AACCGGGGCG CCGGTCACCG GGGACCGGAT TATCGCCATC
TGCGCCGGTT TCATGAAATC CGAGAACCTG CTGAAAAACA ATACCGTGGT CAGCACCGTC
ATGAGCAACA TCGGCCTGAA CCGCGCGCTT CGGGATATGG GGATTTATCA CGTGGTCACC
GATGTGGGGG ACCGCCATGT GACGGCGGCC ATGCTGGAAA AGGGCGCCTC CCTGGGTGGC
GAGGACTCGG GCCACATCGT TTTTTCTGAT TACCAGACAA CAGGTGACGG CCTGCTCACG
GCCCTGATGC TCTGCCGGAT CATGAACCAT ACCGGCAAGC CCCTGTCGGA GCTGGCCGCG
TGCATGGATG TTTTTCCCCA GGTGCTGATC AACGTGAAAG TGGCCCGTAA ACCGGACCTC
GCCTCGGTGC CTGAGGTATG GCAGGTCGTC AGGGATGTTG AGGCCCGTCT TGGCCGGGAG
GGGCGGGTAC TGGTCCGTTA TTCCGGCACC CAGCCCATGT GCCGGGTCAT GGTGGAAGGC
CCTTCGGAAG ACGAAACCCG GCAATGCGCC GGGCAGATTG CCAAAGCAGT TGTGCAGGCC
CTGGGATAA
 
Protein sequence
MTARLFGTDG IRGAANSWPM TPETAMAVGR AVARFMTADG QSPPRILVGK DTRLSGDMLE 
SALCAGICAS GVDAIRVDVL PTPAVAYLTA MLKAGAGIMV SASHNPWTDN GIKIFSHKGH
KLSPVQEAEL EALILSPEPM AAANPPVPGR VFHLMDAEEP YVECLSNITA VGSLSLVLDC
ANGAAARVAP RLFPDARLLS ADPDGRNINE NCGSEHTEAL RAEVVKYRAD AGFAFDGDAD
RLIAVDETGA PVTGDRIIAI CAGFMKSENL LKNNTVVSTV MSNIGLNRAL RDMGIYHVVT
DVGDRHVTAA MLEKGASLGG EDSGHIVFSD YQTTGDGLLT ALMLCRIMNH TGKPLSELAA
CMDVFPQVLI NVKVARKPDL ASVPEVWQVV RDVEARLGRE GRVLVRYSGT QPMCRVMVEG
PSEDETRQCA GQIAKAVVQA LG