Gene Daro_3740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3740 
Symbol 
ID3567375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4018672 
End bp4019649 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content66% 
IMG OID637682214 
Productthiamine-monophosphate kinase 
Protein accessionYP_286939 
Protein GI71909352 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones76 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.129038 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGGCG AATTCGCGCT GATCGACAAG TACTTCGCCC GGCCGACCCC GTCGGCTATC 
CTCGGCCCCG GCGACGACTG CGCCTTGGTC CAGCCCTCAC CGGGCAAGCA ACTGGCCATC
ACCACGGACA TGCTGGTGGC CGGCACACAC TTCCTGCCCG GCACCGACCC GAAGAATCTC
GGCTGGAAAG CCCTCGCCGT CAATCTCTCC GACCTCGCTG CGATGGGCGC CCAACCGCGC
TGGGTCACGC TGGCCGGCGC CTTGCCGAGC GTTGACGAGG CCTGGATCGC CGCCTTCGCC
AGCGGCTTCT TCAACTGTGC CCAGGAATAC GGCGTCGACG TCATCGGCGG CGACACCACC
AAGGGCCCGC TCAACGTCTG CATCACCGCC ATCGGCGAAG TCGAACCCGG CCAAGCCCTG
CGCCGCGATG GCGCCAAGGT CGGTGACCAG ATCTGGGTAT CCGGCCGTCC CGGCCTCGCC
GCCCTCGGCC TCGCCTATCT GCAAGGCAAG GTCAAGCTGC CAGAACCGTG GCCACGGCTA
TGCGTCGGCG CCCTCGAAAA GCCGCGCCCG CGCGTTGCCC TCGGCCTCGC ACTGACCGGC
ATCGCCAGTG CCGCGATCGA TGTTTCCGAC GGTCTGCTGG CCGACCTCGG CCACATTGCC
GAACGCTCTG CCTGTGCAGC CGCCGTCAAA CTCGTTCAGC TACCGCACCT GCCCAAGGGC
GAAAGCTACG ATGCCGACCT TCGACGCATT GCCCTCGAAT GCCAGCTGGC CGGTGGCGAC
GATTACGAAC TCTGCTTCAC TGCCCCCGGT AGCCAAAGTC TGGCCATTGC GCAAATTGCC
GCCCAACTCG AATTGCCGCT GTGGAACATT GGCGAAATGG TGACCGGCCA GGCTGGCGAA
GTCGCTGTAT TCGACCCGGA CGGCAAGCCG GTCGAGTTCA ATCACAAGGG ATACGAGCAC
TTTGGCGCCG AAACCTGA
 
Protein sequence
MAGEFALIDK YFARPTPSAI LGPGDDCALV QPSPGKQLAI TTDMLVAGTH FLPGTDPKNL 
GWKALAVNLS DLAAMGAQPR WVTLAGALPS VDEAWIAAFA SGFFNCAQEY GVDVIGGDTT
KGPLNVCITA IGEVEPGQAL RRDGAKVGDQ IWVSGRPGLA ALGLAYLQGK VKLPEPWPRL
CVGALEKPRP RVALGLALTG IASAAIDVSD GLLADLGHIA ERSACAAAVK LVQLPHLPKG
ESYDADLRRI ALECQLAGGD DYELCFTAPG SQSLAIAQIA AQLELPLWNI GEMVTGQAGE
VAVFDPDGKP VEFNHKGYEH FGAET