Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4174 |
Symbol | |
ID | 5318583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | - |
Start bp | 647437 |
End bp | 649269 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640775979 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001312912 |
Protein GI | 150376316 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.772115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0881714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTAT TCAAGCCTCT CGGCGCGACC CCTTCGGTCA CCCAAGGGCC GCTCCCCGCG TCCCAAAAGG TCTACAGGCC GGGCGATATC CATCCCGACA TCCGCGTGCC GATGCGCGAG ATCACCCTGC ATCCGACCTC GGGCGAACCG CCGGTGACCG TATACGACGC CTCCGGACCC TACACGCTTG AGAATGCCGA TATCCGCATC GACGCAGGAC TGCCGCGTCT GCGCCACGAC TGGATCCTGA AGCGCGGCGG CGTGGAAAGT TACGAGGGCC GAGCGGTCAA GGCGGAAGAC AACGGCTTTG CGAGCGGCGA GCGCCTCACG CCGGAATTCC CGGTGCGCAA TCAGCCGCTT CGCGCCAAGC CGGGACAAGC GGTCACCCAG ATCGCCTTTG CCCGCGCCGG CATCATCACA CCGGAAATGG AGTTCATCGC CATCCGCGAA AATCTCGGCC GCAAAGCCGC GAAGGAGGTA CTCACGCGCG ATGGCGAGAG CTTCGGCGCA TCGGTCCCCG ATTTCGTCAC GCCGGAATTC GTCCGCCAGG AGGTGGCCGC CGGCCGGGCC ATCATTCCGG CCAACATCAA CCATCCGGAG TCGGAGCCGA TGATCATCGG CCGCAACTTT CTGGTGAAGA TCAACGCGAA TATCGGCAAT TCCGCCGTGA CTTCCTCGAT GGCCGAGGAA GTCGAGAAGA TGGTATGGGC GACCCGCTGG GGCGCCGATA CGGTCATGGA CCTTTCCACC GGCCGCAACA TCCACAATAT CCGCGAATGG ATCATCCGCA ATTCGCCGGT GCCGATCGGC ACGGTACCGC TTTACCAGGC ACTGGAAAAG GTGAACGGCA TTGCCGAGGA CCTTTCCTGG GAGGTCTTCC GCGACACGCT AATCGAACAG GCGGAACAGG GCGTAGACTA TTTCACCATT CATGCCGGCG TCAGGCTCCA CTATATCCCG CTCACCGTCG ATCGAGTCAC CGGCATCGTC TCGCGCGGCG GATCGATCAT GGCCAAGTGG TGTCTGCATC ACCACCAGGA GAGCTTCCTC TACGAGCATT TCGAGGAGAT CTGCGACATC TGCCGCGCCT ATGACGTTTC CTTTTCGCTG GGCGACGGTC TGCGGCCCGG TTCTATCGCC GACGCCAACG ACCGGGCGCA GTTCGCCGAA CTCGAAACGC TCGGCGAGTT GACGCAGATC GCCTGGGCCA GGGATTGCCA GGTGATGATC GAAGGCCCCG GCCACGTGCC GATGCACAAG ATCAAGGAGA ACATGGACAA GCAGCTTGCC GTTTGCGGCG AAGCGCCTTT CTACACGCTG GGGCCGCTGA CCACCGATAT CGCGCCGGGC TACGACCACA TCACCTCCGG CATCGGCGCG GCGATGATCG GCTGGTTCGG CACCGCGATG CTCTGCTACG TGACGCCGAA GGAGCATCTG GGCCTTCCCG ACCGCAACGA CGTCAAGATG GGCGTCATCA CCTACAAGAT CGCCGCGCAT GCCGCCGATC TCGCCAAGGG GCATCCGGCC GCGCGCATCC GTGACGATGC ACTGTCGCGC GCGCGCTTCG AGTTCCGCTG GGAGGATCAG TTCAATCTCT CGCTCGACCC GGAGACGGCG CGGAACTTCC ATGACGAAAC GCTGCCGAAG GAGGCGCACA AGGTCGCGCA TTTCTGCTCC ATGTGCGGTC CGAAATTCTG TTCGATGCGG ATTTCCCACG ACATCCGTGC CGAGGCGCAG AAGGAGGGCC TTGAGGCGAT GGCAGCGAAA TACCGCGACG GCGGCGATCT CTATATGCCG GTCGATGGGA GCGATGCGCC CACTGGAGAA TGA
|
Protein sequence | MNLFKPLGAT PSVTQGPLPA SQKVYRPGDI HPDIRVPMRE ITLHPTSGEP PVTVYDASGP YTLENADIRI DAGLPRLRHD WILKRGGVES YEGRAVKAED NGFASGERLT PEFPVRNQPL RAKPGQAVTQ IAFARAGIIT PEMEFIAIRE NLGRKAAKEV LTRDGESFGA SVPDFVTPEF VRQEVAAGRA IIPANINHPE SEPMIIGRNF LVKINANIGN SAVTSSMAEE VEKMVWATRW GADTVMDLST GRNIHNIREW IIRNSPVPIG TVPLYQALEK VNGIAEDLSW EVFRDTLIEQ AEQGVDYFTI HAGVRLHYIP LTVDRVTGIV SRGGSIMAKW CLHHHQESFL YEHFEEICDI CRAYDVSFSL GDGLRPGSIA DANDRAQFAE LETLGELTQI AWARDCQVMI EGPGHVPMHK IKENMDKQLA VCGEAPFYTL GPLTTDIAPG YDHITSGIGA AMIGWFGTAM LCYVTPKEHL GLPDRNDVKM GVITYKIAAH AADLAKGHPA ARIRDDALSR ARFEFRWEDQ FNLSLDPETA RNFHDETLPK EAHKVAHFCS MCGPKFCSMR ISHDIRAEAQ KEGLEAMAAK YRDGGDLYMP VDGSDAPTGE
|
| |