Gene Ksed_19710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagKsed_19710 
Symbol 
ID8373476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameKytococcus sedentarius DSM 20547 
KingdomBacteria 
Replicon accessionNC_013169 
Strand
Start bp2055895 
End bp2057559 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content71% 
IMG OID644992225 
Producthydroxymethylpyrimidine synthase 
Protein accessionYP_003149735 
Protein GI256825775 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.19694 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00821817 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCGTGA CCGACGGGGA GGTGACCGTG CCGATGACGG CGATCAGCCA GCACGACTCC 
CCGGACGGCA GCCCCAACGA GGACTTCGTC GTGTACCGCA CGATGGGGCC GGGGTCCGAC
CCGCAGGTGG GCCTGGAACC GGTGCGGCAG GCGTGGATCG AGGCGCGCGG CGACACCGCG
ACCCATGCCG GGCGGGAGCG TGACCTCGCC GACGACGGGC GCTCGGCCCT GCGGCGCGGG
GAGCCCTCGC AGGCGTGGCG GGGTCGGGTG CAGGAGCCGC GGCGCGCGCA GCCGGGCCGG
ACGGTGACGC AGCTGCACTA CGCCCGCCGG GGCGAGATCA CCCCCGAGAT GCGGTACGTG
GCCCTGCGCG AGGGCTGCGA GGTGGAGCTG GTGCGCTCGG AGGTGGCGGC GGGGCGCGCC
ATCATTCCGG CGAACGTGAA CCACCCCGAG TCCGAGCCGA TGGTCATCGG CCGGGCCTTC
CTCACCAAGG TGAACGCCAA CATCGGCAAC TCGGCCGTCA CCTCCTCGAT CGCCGAGGAG
GTGGAGAAGA TGTCCTGGGC CACCCGCTGG GGCGCCGACA CGGTGATGGA TCTCTCCACC
GGGGAGGACA TCCACACCAC GCGGGAGTGG ATCCTGCGCA ACTCCCCGGT GCCGATCGGC
ACGGTCCCCA TCTACCAGGC CCTGGAGAAG GTCGACGGCG TGGCCGAGGA GCTCACATGG
GAGGTCTTCC GCGACACGGT TGTCGAGCAG GCAGAGCAGG GCGTGGACTA CATGACCGTG
CACGCCGGGG TGCGGCGGCG GCACGTGCCT CTGACGGCCG GGCGTGTGAC CGGCATCGTC
TCCCGCGGCG GGTCGATCAT GGCCGGGTGG TGCATGGCCC ACCGCCGCGA GTCGTTCCTG
TACGAGCACT TCGACGAGCT GTGTGAGATC TTCGCCGCGC ACGACGTCGC CTTCTCGCTG
GGCGACGGGC TGCGGCCGGG GTCGCTGGCC GACGCGAACG ACGCCGCCCA GCTGGCCGAG
CTGCGCACCC TGGGCGAGCT GACGCAGCGG GCCTGGGAGC ACGACGTGCA GGTGATGGTG
GAGGGCCCCG GCCACGTGCC GCTGCACCTG GTCAAGGAGA ACGTGGACCT GGAGCAGGAG
TGGTGCCACG GGGCGCCCTT CTACACGCTG GGCCCGCTGG CCACCGACAT CGCGCCGGGC
TACGACCACA TCACCAGTGC CATCGGGGCG GCCCCCATCG CCCAACACGG CACGGCGATG
CTCTGCTACG TCACGCCGAA GGAGCACCTG GGCCTGCCGA ACCGGGACGA CGTGAAGACC
GGGGTGATCA CCTACAAGAT CGCCGCGCAC GCCGCGGACG TCGCCAAGGG CCACCCGCGG
GCCCGGGACT GGGACGACGC CATGAGTAAG GCGCGCTTCG AGTTCCGTTG GCACGACCAG
TTCGCGCTGG CGCTGGACCC GGTGACCGCG CAGGAGTTCC ACGACGAGAC GCTGCCGGCC
GAGCCGGCCA AGAACGCGCA GTTCTGCTCG ATGTGCGGGC CGAAGTTCTG CTCGATGCGG
ATCAGTCGTG ACATCAACGA GGCCTACGGC GGGCAGATGG CCGCCGAGGC GGGTGCCGCG
GCCGTCGGCG AGCCGGTGTT CGTCGAGTTG AGCACCAGGC CGTGA
 
Protein sequence
MTVTDGEVTV PMTAISQHDS PDGSPNEDFV VYRTMGPGSD PQVGLEPVRQ AWIEARGDTA 
THAGRERDLA DDGRSALRRG EPSQAWRGRV QEPRRAQPGR TVTQLHYARR GEITPEMRYV
ALREGCEVEL VRSEVAAGRA IIPANVNHPE SEPMVIGRAF LTKVNANIGN SAVTSSIAEE
VEKMSWATRW GADTVMDLST GEDIHTTREW ILRNSPVPIG TVPIYQALEK VDGVAEELTW
EVFRDTVVEQ AEQGVDYMTV HAGVRRRHVP LTAGRVTGIV SRGGSIMAGW CMAHRRESFL
YEHFDELCEI FAAHDVAFSL GDGLRPGSLA DANDAAQLAE LRTLGELTQR AWEHDVQVMV
EGPGHVPLHL VKENVDLEQE WCHGAPFYTL GPLATDIAPG YDHITSAIGA APIAQHGTAM
LCYVTPKEHL GLPNRDDVKT GVITYKIAAH AADVAKGHPR ARDWDDAMSK ARFEFRWHDQ
FALALDPVTA QEFHDETLPA EPAKNAQFCS MCGPKFCSMR ISRDINEAYG GQMAAEAGAA
AVGEPVFVEL STRP