Gene Daro_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3591 
Symbol 
ID3568255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3846820 
End bp3848826 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content65% 
IMG OID637682064 
Producttransketolase 
Protein accessionYP_286790 
Protein GI71909203 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0021] Transketolase 
TIGRFAM ID[TIGR00232] transketolase, bacterial and yeast 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0287801 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCA GCAACCTCCC CAAGTTTTCT CCCCTGACCG GCGCCATCCG CGCCCTCGCC 
ATGGATGCAG TCCAGCAGGC CAACTCCGGG CATCCCGGCG CCCCAATGGG CATGGCTGAA
ATCGCCGAAG TCCTCTGGCG CCGCCACCTG CGTCACAACC CGGCCAACCC GCACTGGGCC
GACCGCGACC GCTTTGTGCT GTCGAACGGC CACGGCTCGA TGCTGCTCTA CGCGCTGCTG
CACCTGACCG GCTACGATCT GTCGATCGAT GACCTGAAGA ATTTCCGCCA GTTGCACGCC
AAGACCCCGG GCCATCCGGA ATACGGCTAC ACGCCGGGCA TTGAAACGAC CACCGGCCCG
CTCGGCCAGG GCATCACCAA CGCCGTCGGC TTCGCGCTGG CTGAAAAGGT GCTGGCCGCC
GAGTTCAACA AACCCGGCCA CGAAATCGTC AATCACCACA CCTATGTCTT CCTGGGCGAC
GGCTGCCTGA TGGAAGGCGT GTCGCATGAA GCCTGCTCGC TGGCCGGCAC GCTCGGCCTC
GGCAAGCTGA TCGCCTTCTG GGACGATAAC GGCATCTCGA TCGACGGTCA CGTCGAAGGC
TGGTTCACCG ACGACACCCC GAAGCGCTTT GAAGCCTACG GCTGGCATGT CGTCGCCCAC
GTCGACGGCC ATGATTCCGA CGCCATAGAA CGCGCCCTGC TCGCCGCCAA GGCAGTCACC
GACAAGCCCA GCCTGATCTG CTGCAAGACG ACCATCGGCG CCGGCTCGCC GAACAAGCAG
GGATCGCACG ACTGCCACGG CGCCCCGCTC GGCAAGGACG AAATCGCTGC CGCCCGCGCT
TACATCGGCT GGAACCACCC GGCCTTCGAA ATCCCGGCCG ACATTTACGC CGCCTGGAAT
CGCAAGCCGG CCGGTGCTGT TTTTGAGGAA AACTGGAGCA CCCGTTTCGC TGCCTACCGC
ACCGCCTTCC CGGCCGAAGC CGCCGAATTC GAGCGCCGCG TCATCAAGAA CGAACTGCCA
ACCAACTGGG CAGCGACCAA GGCCGCCTAC ATCGCCACCT GCCGCGACAA GGCCGAGAAC
ATCGCCACCC GAAAGGCTTC GCAGAACGCC ATTGCCGCAC TGGTCCCGGC CGTGCCGGAA
ATCTTCGGCG GCTCAGCCGA CCTGGCCGGC TCCAACCTGA CCTTCGTCAA GGGCAGCAAG
GGCGTCACCC GCACCGAGGG CGGCAACTAT TGCTACTACG GTGTGCGCGA ATTCGGCATG
ACCGCCATCG CCAACGGCAT CGCGCTGCAT GGTGGCCTGG TGCCCTACAC CGCGACTTTC
CTGGTCTTCT CCGACTACGC CCGCAACGCC ATCCGTATGG CGGCGTTGAT GAAGCAGCGC
CAGATCATGG TCTATACCCA TGACTCCATC GGTCTCGGCG AAGATGGCCC GACGCACCAG
CCGGTCGAGC ATATCCCGTC GATGCGCATC ATCCCGAACC TCGACGTCTG GCGCCCGGCC
GACGCGACCG AAACGGCCAT TGCCTGGACC GCAGCGGTCG AGCGCAAGGA TGGCCCGAGC
ATCCTCGCCC TGTCGCGCCA GAACCTGCCG ACCGTCACCC AGCAGGCGGC CGATGCCGAC
ATCGCCAAGG GCGGCTATGT ACTGGCCGAA GCGGATGGCG AAGCGCAGAT CACCTTCATT
GCCACCGGCT CCGAAATCAA GCTGGCGCTC GACGCCCAGG CTGCACTGGC CGGCGAAGGG
ATCAAGACCC GCGTCGTCTC GATGCCCTGC TCCAATGTTT TCGACCGCCA GAGCGCCGAA
TACAAAGCCT CGGTGCTCGG CGCCTGCAAA AAACGCATCG CCATCGAAGC CGCTCACCCG
GACTTCTGGC GCAAGTACGT CGGCCTGCAT GGCGCCGTGA TCGGTATCGA CCGCTTCGGC
GAGTCCGCAC CGGCCGGCCA GCTGTTCGAC CTGTTCGGTT TCACCGTCGC CAACGTCGTC
AAGACGGCCA AGGCACTGTT GTCCTGA
 
Protein sequence
MSVSNLPKFS PLTGAIRALA MDAVQQANSG HPGAPMGMAE IAEVLWRRHL RHNPANPHWA 
DRDRFVLSNG HGSMLLYALL HLTGYDLSID DLKNFRQLHA KTPGHPEYGY TPGIETTTGP
LGQGITNAVG FALAEKVLAA EFNKPGHEIV NHHTYVFLGD GCLMEGVSHE ACSLAGTLGL
GKLIAFWDDN GISIDGHVEG WFTDDTPKRF EAYGWHVVAH VDGHDSDAIE RALLAAKAVT
DKPSLICCKT TIGAGSPNKQ GSHDCHGAPL GKDEIAAARA YIGWNHPAFE IPADIYAAWN
RKPAGAVFEE NWSTRFAAYR TAFPAEAAEF ERRVIKNELP TNWAATKAAY IATCRDKAEN
IATRKASQNA IAALVPAVPE IFGGSADLAG SNLTFVKGSK GVTRTEGGNY CYYGVREFGM
TAIANGIALH GGLVPYTATF LVFSDYARNA IRMAALMKQR QIMVYTHDSI GLGEDGPTHQ
PVEHIPSMRI IPNLDVWRPA DATETAIAWT AAVERKDGPS ILALSRQNLP TVTQQAADAD
IAKGGYVLAE ADGEAQITFI ATGSEIKLAL DAQAALAGEG IKTRVVSMPC SNVFDRQSAE
YKASVLGACK KRIAIEAAHP DFWRKYVGLH GAVIGIDRFG ESAPAGQLFD LFGFTVANVV
KTAKALLS