Gene Dole_3059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_3059 
Symbol 
ID5695919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp3664891 
End bp3666654 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content62% 
IMG OID641265676 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001530939 
Protein GI158523069 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000769965 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTACG ATGCAACAAA AATGGCCGAC TGGCAGATTT CCGAGGAAGC GGAAAAGAAC 
ATGCCCATGC CCGAAGAGTG GTGTGAGAAG CTGGGACTTG AAAAGGAAGA GATGCTGGCC
ATGGGCCGGC TGTCCAAGCT GGACTTTCTG AAGATCATCA AACGGCTGGA AGCCAAACCC
GACGGCAAGT ACATTGAGGT GACTGCCATC ACCCCGACCC CGCTGGGAGA GGGCAAAAGC
ACCACCTCCC TGGGCCTGAT GGAAGGCCTG GGCGCCCGGG GCAAAAGTGT GGGCGGTGCT
CTGCGCCAGC CCTCCGGCGG CCCCACCATG AACGTCAAGG GCACGGCGGC CGGCGGCGGC
AACTCCCTGC TGATTCCCAT GACCGAGTTC TCCCTGGGAC TGACCGGCGA CATCAACGAC
ATCATGAACG CCCACAACCT GGGCATGGTG GCCATGACCG CCCGCATGCA GCACGAGCGC
AATTACAACG ACGAGCAGCT TCAGCGCCTC ACCGGCATGC GCCGCCTGGA CATCGATCCC
ACCCGCGTTG AGATGGGCTG GATCATTGAC TTCTGTGCCC AGGCCCTGCG CAACATCGTC
ATCGGCCTCG GCGGCCGCAC CGACGGCTAC ACCATGCAGT CCAAGTTCGG CATTGCCGTG
GGCTCCGAGC TCATGGCCAT CCTGGCCGTG GCCACCGACC TGGCCGACCT GAAGGAGCGC
ATCAACAACA TCACCGTGGC CTTTGACAAG TCCGGCAAAC CGGTCACCTG CCGTGACCTG
GAAGTGGGCA ACGCCATGGC CGCCTTCATG CGCAACACCA TCAACCCCAC CCTCATGAGC
ACCGCCGAGT ACCAGCCCTG CCTGGTGCAT GCGGGTCCCT TTGCCAACAT CGCCGTGGGC
CAGAGCTCCA TCATTGCCGA CCGCGTGGGC CTCAAGCTGT GGGACTACCA TGTCACGGAG
TCCGGGTTTG CCGCTGACAT CGGTTTTGAA AAATTCTGGA ACGTCAAGTG CCGTTTCTCC
GGCCTCAAGC CCCATGTGTC GGTTCTGACC GCAACCATCC GCGCACTGAA GATGCACGGC
GGCGGCCCCA AGGTCGTGGC CGGCAAGGCC CTGGACGACG CCTACACCAA GGAGAATCTG
GCCCTGGTGG AAAAGGGTGT CGAGAACATG GTCCACATGA TCGGCGTGAT CCGTAAATCC
GGCATTAACC CGGTGGTCTG TGTCAACCGC TTCTACACCG ACACCGATGC TGAAGTCGCT
ATCGTTAAGA AAGCGGCCGA GGCGGCCGGC GCCCGCTGCG CCGAGTCCAA GCACTGGGAA
AAAGGCGGCG AAGGCGCTTT GGAATTTGCC GATGCCGTTA TTGATGCCTG TGAAGAAGGC
AATGACTTTG ACTTCCTGTA TCCGCTGGAG ATGAAACTGC GCGACCGTGT TGATAAGATC
GCCAGGGAAG TGTACGGCGC CGACGGCGTT GATTGGTCTC CGGAAGCCAC GGCCAAGGCC
GAAATGCTGG AGAACGATCC CAAGTACGCC GACTTTGCCA CCATGATGGT CAAGACCCAC
CTCTCCCTCA CCCACGACCC GGTCAAGAAG GGTGTGCCCA AGGGGTGGCG GCTGCCCATC
CGCGACGTGC TGATTTACTC GGGCGCCAAG TTCCTGTGCC CCTGCGCAGG CACCATCAGC
CTGATGCCGG GTACCGGTTC CAACCCGGCT TTCCGTCGCA TCGACGTTGA CCCGGCCACC
GGCAAGGTCT CCGGCCTGTT CTAG
 
Protein sequence
MAYDATKMAD WQISEEAEKN MPMPEEWCEK LGLEKEEMLA MGRLSKLDFL KIIKRLEAKP 
DGKYIEVTAI TPTPLGEGKS TTSLGLMEGL GARGKSVGGA LRQPSGGPTM NVKGTAAGGG
NSLLIPMTEF SLGLTGDIND IMNAHNLGMV AMTARMQHER NYNDEQLQRL TGMRRLDIDP
TRVEMGWIID FCAQALRNIV IGLGGRTDGY TMQSKFGIAV GSELMAILAV ATDLADLKER
INNITVAFDK SGKPVTCRDL EVGNAMAAFM RNTINPTLMS TAEYQPCLVH AGPFANIAVG
QSSIIADRVG LKLWDYHVTE SGFAADIGFE KFWNVKCRFS GLKPHVSVLT ATIRALKMHG
GGPKVVAGKA LDDAYTKENL ALVEKGVENM VHMIGVIRKS GINPVVCVNR FYTDTDAEVA
IVKKAAEAAG ARCAESKHWE KGGEGALEFA DAVIDACEEG NDFDFLYPLE MKLRDRVDKI
AREVYGADGV DWSPEATAKA EMLENDPKYA DFATMMVKTH LSLTHDPVKK GVPKGWRLPI
RDVLIYSGAK FLCPCAGTIS LMPGTGSNPA FRRIDVDPAT GKVSGLF