Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ksed_19710 |
Symbol | |
ID | 8373476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Kytococcus sedentarius DSM 20547 |
Kingdom | Bacteria |
Replicon accession | NC_013169 |
Strand | - |
Start bp | 2055895 |
End bp | 2057559 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644992225 |
Product | hydroxymethylpyrimidine synthase |
Protein accession | YP_003149735 |
Protein GI | 256825775 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 0.19694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00821817 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGTGA CCGACGGGGA GGTGACCGTG CCGATGACGG CGATCAGCCA GCACGACTCC CCGGACGGCA GCCCCAACGA GGACTTCGTC GTGTACCGCA CGATGGGGCC GGGGTCCGAC CCGCAGGTGG GCCTGGAACC GGTGCGGCAG GCGTGGATCG AGGCGCGCGG CGACACCGCG ACCCATGCCG GGCGGGAGCG TGACCTCGCC GACGACGGGC GCTCGGCCCT GCGGCGCGGG GAGCCCTCGC AGGCGTGGCG GGGTCGGGTG CAGGAGCCGC GGCGCGCGCA GCCGGGCCGG ACGGTGACGC AGCTGCACTA CGCCCGCCGG GGCGAGATCA CCCCCGAGAT GCGGTACGTG GCCCTGCGCG AGGGCTGCGA GGTGGAGCTG GTGCGCTCGG AGGTGGCGGC GGGGCGCGCC ATCATTCCGG CGAACGTGAA CCACCCCGAG TCCGAGCCGA TGGTCATCGG CCGGGCCTTC CTCACCAAGG TGAACGCCAA CATCGGCAAC TCGGCCGTCA CCTCCTCGAT CGCCGAGGAG GTGGAGAAGA TGTCCTGGGC CACCCGCTGG GGCGCCGACA CGGTGATGGA TCTCTCCACC GGGGAGGACA TCCACACCAC GCGGGAGTGG ATCCTGCGCA ACTCCCCGGT GCCGATCGGC ACGGTCCCCA TCTACCAGGC CCTGGAGAAG GTCGACGGCG TGGCCGAGGA GCTCACATGG GAGGTCTTCC GCGACACGGT TGTCGAGCAG GCAGAGCAGG GCGTGGACTA CATGACCGTG CACGCCGGGG TGCGGCGGCG GCACGTGCCT CTGACGGCCG GGCGTGTGAC CGGCATCGTC TCCCGCGGCG GGTCGATCAT GGCCGGGTGG TGCATGGCCC ACCGCCGCGA GTCGTTCCTG TACGAGCACT TCGACGAGCT GTGTGAGATC TTCGCCGCGC ACGACGTCGC CTTCTCGCTG GGCGACGGGC TGCGGCCGGG GTCGCTGGCC GACGCGAACG ACGCCGCCCA GCTGGCCGAG CTGCGCACCC TGGGCGAGCT GACGCAGCGG GCCTGGGAGC ACGACGTGCA GGTGATGGTG GAGGGCCCCG GCCACGTGCC GCTGCACCTG GTCAAGGAGA ACGTGGACCT GGAGCAGGAG TGGTGCCACG GGGCGCCCTT CTACACGCTG GGCCCGCTGG CCACCGACAT CGCGCCGGGC TACGACCACA TCACCAGTGC CATCGGGGCG GCCCCCATCG CCCAACACGG CACGGCGATG CTCTGCTACG TCACGCCGAA GGAGCACCTG GGCCTGCCGA ACCGGGACGA CGTGAAGACC GGGGTGATCA CCTACAAGAT CGCCGCGCAC GCCGCGGACG TCGCCAAGGG CCACCCGCGG GCCCGGGACT GGGACGACGC CATGAGTAAG GCGCGCTTCG AGTTCCGTTG GCACGACCAG TTCGCGCTGG CGCTGGACCC GGTGACCGCG CAGGAGTTCC ACGACGAGAC GCTGCCGGCC GAGCCGGCCA AGAACGCGCA GTTCTGCTCG ATGTGCGGGC CGAAGTTCTG CTCGATGCGG ATCAGTCGTG ACATCAACGA GGCCTACGGC GGGCAGATGG CCGCCGAGGC GGGTGCCGCG GCCGTCGGCG AGCCGGTGTT CGTCGAGTTG AGCACCAGGC CGTGA
|
Protein sequence | MTVTDGEVTV PMTAISQHDS PDGSPNEDFV VYRTMGPGSD PQVGLEPVRQ AWIEARGDTA THAGRERDLA DDGRSALRRG EPSQAWRGRV QEPRRAQPGR TVTQLHYARR GEITPEMRYV ALREGCEVEL VRSEVAAGRA IIPANVNHPE SEPMVIGRAF LTKVNANIGN SAVTSSIAEE VEKMSWATRW GADTVMDLST GEDIHTTREW ILRNSPVPIG TVPIYQALEK VDGVAEELTW EVFRDTVVEQ AEQGVDYMTV HAGVRRRHVP LTAGRVTGIV SRGGSIMAGW CMAHRRESFL YEHFDELCEI FAAHDVAFSL GDGLRPGSLA DANDAAQLAE LRTLGELTQR AWEHDVQVMV EGPGHVPLHL VKENVDLEQE WCHGAPFYTL GPLATDIAPG YDHITSAIGA APIAQHGTAM LCYVTPKEHL GLPNRDDVKT GVITYKIAAH AADVAKGHPR ARDWDDAMSK ARFEFRWHDQ FALALDPVTA QEFHDETLPA EPAKNAQFCS MCGPKFCSMR ISRDINEAYG GQMAAEAGAA AVGEPVFVEL STRP
|
| |