Gene Dole_0293 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_0293 
Symbol 
ID5693112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp336565 
End bp337605 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID641262874 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001528180 
Protein GI158520310 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCTC TGACCTACGC GGATGCCGGC GTGGATATTG ATAAAGCCAA CGCGCTTGTG 
GACAACATCA AGAAAATCGC CAAGCAGACC CGGCGGCAGG GGGTGATGGG CGACATCGGC
GGGTTCGGCG GGCTTTTCTC CCTGAACCTG TCGGACCTGA AAAATCCTGT GCTGGTCAGC
TCCACCGACG GTGTGGGCAC CAAGCTGAAG ATCGCCTTTA TGGCGGGCCG GCACGATACC
GTGGGCATCG ACCTTGTGGC CATGTGCGTC AACGATATTG CCGTCCAGGG CGCAAAACCG
CTCTTTTTTC TGGACTACAT GGCCGTGGGA AAGCTGAATA CAGAGATCGC CACCCAGGTG
ATCACCGGCA TCGGAGAGGG ATGCAAACAG GCCAAATGCG CCCTGATCGG CGGTGAAACC
GCTGAAATGC CGGGTTTTTA CAAAGACAAC GAGTATGACC TGGCCGGTTT TACCGTGGGC
ATCGTGGAGA GCGACGCCAT TATTGACGGG TCCAACATTC ACGTGGGCGA CGCCATTATC
GGCATCGCTT CCAGCGGGCT GCACAGCAAC GGTTTTTCTC TGGTCCGCAA GATATGCTTT
GACGTGCTGA AGCTCAAGAT TGACGATCAT ATCGACGATC TGGGCAAAAC CCTGGCCGAG
GAGCTGTTGA CCCCCACTAT CATTTATTCG GAGACGGTTC ACAGCCTGCT CAAGCTCTTT
CCGATTCACG GCATCGCCCA TATCACCGGC GGCGGTCTGG CCGAAAACGT GGTCCGGGTG
CTGCCCCAGG CCTGCGTGGC CACCATTCGA AAAGGATCAT GGGACGTGCC TCCGGTCTTT
TCTTTTTTGC AAAAGGCAGG AAAGGTCGAA GACCGCGAGA TGACCCGCAC CTTTAACAAC
GGCATCGGCC TGGTGGTGGT GGTGCCCGCG AAAAAAGCCG ACGATGCCAT GGCAAGCATT
CGGGCCGTGG GGGAAAAGCC GTTTCTGATC GGTGAGATCA CCCCCAGAAA AGCGGATGAA
CCCCAGGTGC AACTGGTGTA A
 
Protein sequence
MSSLTYADAG VDIDKANALV DNIKKIAKQT RRQGVMGDIG GFGGLFSLNL SDLKNPVLVS 
STDGVGTKLK IAFMAGRHDT VGIDLVAMCV NDIAVQGAKP LFFLDYMAVG KLNTEIATQV
ITGIGEGCKQ AKCALIGGET AEMPGFYKDN EYDLAGFTVG IVESDAIIDG SNIHVGDAII
GIASSGLHSN GFSLVRKICF DVLKLKIDDH IDDLGKTLAE ELLTPTIIYS ETVHSLLKLF
PIHGIAHITG GGLAENVVRV LPQACVATIR KGSWDVPPVF SFLQKAGKVE DREMTRTFNN
GIGLVVVVPA KKADDAMASI RAVGEKPFLI GEITPRKADE PQVQLV