Gene Dtox_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtox_0231 
Symbol 
ID8427155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfotomaculum acetoxidans DSM 771 
KingdomBacteria 
Replicon accessionNC_013216 
Strand
Start bp250102 
End bp251805 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content47% 
IMG OID645032618 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_003189807 
Protein GI258513585 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTG TACCTAGCGA TCTTGAAATT GTACAAGCGC ACAAGATGAC CCCGATTGGG 
GAAATTGCTG CAAAAATGGG TCTTACCGAG GACGATTTTG ATTATTATGG CAAGTATAAA
GCCAAAATTA GTCTGGACGT AATTGAAAAG TTTAAGGATC GTCCTAACGC AAAACTAATC
GATGTTACTG CTATTACTCC CACTCCGCTG GGCGAAGGTA AAACCGTTAC CACCATCAGC
TTGACTCAAG GTTTAGGACA TATTGGCAAA AAGGTTATCT GCACACTGCG TCAACCTTCT
CTGGGCCCTG TTTTTGGTAT TAAGGGCGGT GCCGCTGGCG GCGGTTATGC TCAGGTAGTT
CCTATGGAAG ATCTGAACAT CCATTTCACC GGTGACATTC ATGCTATTGA AACTGCTAAC
AACCTGCTGG CAGCTATGAT TGATACTTCA ATCCTGCTTG ACAACCCCCT GAACATCGAT
CCGATGTCTA TCATGTGGAG ACGCGTTTTT GACTTAAACG ACCGTGCGCT GCGTGACATC
GTTATTGGTT TAGGCGGTAA AGAAAACGGT TATCCGCGTC AGACCGGCTT TGACATTGCT
GTTGCTTCCG AAGTTATGGC TATTCTCGCA TTAACCACCA GCTTGCAGGA TATGCGTGAG
CGTTTTGCTC GCATCATCTT TGGTTTTACC TATGACGGAA AGCCTGTTAC TGCTGAGCAA
ATCAAGGCTG CAGGCTCTAT GACAGTTATT ATGAAGGAAG CTATCAAGCC GAACCTCGTA
CAGACCTTAG AAGGCCAGCC TTGCATTATG CACGCAGGCC CGTTTGCTAA CATTGCTCAT
GGTCAAAACT CCGTTCTGGC TGACATGATT GCCCTCAAGT GTGCAGACTA TGTTGTAACC
GAGTCCGGTT TCGGTGCTGA CATGGGTATG CAGAAGTTCA TGGATATCAA ATGTCGTCAA
TCCGGCTTGC GCCCGAACTG CGTAGTAGTA ACCTCTACTG TTCGTTCACT GAAAATGCAT
GGTGGCGTAG GTAATATCGT TGCAGGTAAA CCGCTGCCGA AAGAATTGAC AGAAGAGAAC
CTGCCGGCTC TGGAAAAAGG CGCGGCCAAC ATGATGCATA TGATTAAGAT AGCTAAGGGT
TATGGAATAC CGGTTGTTGT ATCCATTAAC CGTTTCATCA CTGACACCGA CAATGAAATT
AACTTGCTTA AAGATAAGGC TAAAGAAGCA GGCGCCTTTG GCGTTGGCGT CAACACTGCC
TGGGGTGACG GCGGCGTTGG ATGTGCTGAA GTGGCTGAAT TGGTTGTTAA GGCTTGCGAA
GAACCGACTG ACTTCCAGTT CCTCTATCCC GACAGCTTTA CCATTAAGGA AAAAATCGAA
ACCATGGCTA AGAAAATCTA CAACGCTGAC GGCGTATCCT ATGACCCGCT GGCAGAAAAG
AAAATTGCTC AGTTTGAAGA ACTCGGCTTA GGCAATTTAC CGATCAACAT GGCTAAGACT
CACCTGTCCA TCTCACATGA TCCGGGAATG AAGAATGTTC CGAGCGGCTA CATTTTCCCG
ATCCGTGATA TTCGCGCATC CGTAGGTGCG GGCTTCCTGT ATCCTCTGGC AGGCGCAATG
AGAACTATGC CTGGTTTGGG ACGTGCGCCT TCCGCTATTA AGGTTGATAT TGATAAAGAC
GGCAAGATAG TTGGTTTGTT CTAG
 
Protein sequence
MKVVPSDLEI VQAHKMTPIG EIAAKMGLTE DDFDYYGKYK AKISLDVIEK FKDRPNAKLI 
DVTAITPTPL GEGKTVTTIS LTQGLGHIGK KVICTLRQPS LGPVFGIKGG AAGGGYAQVV
PMEDLNIHFT GDIHAIETAN NLLAAMIDTS ILLDNPLNID PMSIMWRRVF DLNDRALRDI
VIGLGGKENG YPRQTGFDIA VASEVMAILA LTTSLQDMRE RFARIIFGFT YDGKPVTAEQ
IKAAGSMTVI MKEAIKPNLV QTLEGQPCIM HAGPFANIAH GQNSVLADMI ALKCADYVVT
ESGFGADMGM QKFMDIKCRQ SGLRPNCVVV TSTVRSLKMH GGVGNIVAGK PLPKELTEEN
LPALEKGAAN MMHMIKIAKG YGIPVVVSIN RFITDTDNEI NLLKDKAKEA GAFGVGVNTA
WGDGGVGCAE VAELVVKACE EPTDFQFLYP DSFTIKEKIE TMAKKIYNAD GVSYDPLAEK
KIAQFEELGL GNLPINMAKT HLSISHDPGM KNVPSGYIFP IRDIRASVGA GFLYPLAGAM
RTMPGLGRAP SAIKVDIDKD GKIVGLF