Gene Smed_4174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4174 
Symbol 
ID5318583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp647437 
End bp649269 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content63% 
IMG OID640775979 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001312912 
Protein GI150376316 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.772115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0881714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTAT TCAAGCCTCT CGGCGCGACC CCTTCGGTCA CCCAAGGGCC GCTCCCCGCG 
TCCCAAAAGG TCTACAGGCC GGGCGATATC CATCCCGACA TCCGCGTGCC GATGCGCGAG
ATCACCCTGC ATCCGACCTC GGGCGAACCG CCGGTGACCG TATACGACGC CTCCGGACCC
TACACGCTTG AGAATGCCGA TATCCGCATC GACGCAGGAC TGCCGCGTCT GCGCCACGAC
TGGATCCTGA AGCGCGGCGG CGTGGAAAGT TACGAGGGCC GAGCGGTCAA GGCGGAAGAC
AACGGCTTTG CGAGCGGCGA GCGCCTCACG CCGGAATTCC CGGTGCGCAA TCAGCCGCTT
CGCGCCAAGC CGGGACAAGC GGTCACCCAG ATCGCCTTTG CCCGCGCCGG CATCATCACA
CCGGAAATGG AGTTCATCGC CATCCGCGAA AATCTCGGCC GCAAAGCCGC GAAGGAGGTA
CTCACGCGCG ATGGCGAGAG CTTCGGCGCA TCGGTCCCCG ATTTCGTCAC GCCGGAATTC
GTCCGCCAGG AGGTGGCCGC CGGCCGGGCC ATCATTCCGG CCAACATCAA CCATCCGGAG
TCGGAGCCGA TGATCATCGG CCGCAACTTT CTGGTGAAGA TCAACGCGAA TATCGGCAAT
TCCGCCGTGA CTTCCTCGAT GGCCGAGGAA GTCGAGAAGA TGGTATGGGC GACCCGCTGG
GGCGCCGATA CGGTCATGGA CCTTTCCACC GGCCGCAACA TCCACAATAT CCGCGAATGG
ATCATCCGCA ATTCGCCGGT GCCGATCGGC ACGGTACCGC TTTACCAGGC ACTGGAAAAG
GTGAACGGCA TTGCCGAGGA CCTTTCCTGG GAGGTCTTCC GCGACACGCT AATCGAACAG
GCGGAACAGG GCGTAGACTA TTTCACCATT CATGCCGGCG TCAGGCTCCA CTATATCCCG
CTCACCGTCG ATCGAGTCAC CGGCATCGTC TCGCGCGGCG GATCGATCAT GGCCAAGTGG
TGTCTGCATC ACCACCAGGA GAGCTTCCTC TACGAGCATT TCGAGGAGAT CTGCGACATC
TGCCGCGCCT ATGACGTTTC CTTTTCGCTG GGCGACGGTC TGCGGCCCGG TTCTATCGCC
GACGCCAACG ACCGGGCGCA GTTCGCCGAA CTCGAAACGC TCGGCGAGTT GACGCAGATC
GCCTGGGCCA GGGATTGCCA GGTGATGATC GAAGGCCCCG GCCACGTGCC GATGCACAAG
ATCAAGGAGA ACATGGACAA GCAGCTTGCC GTTTGCGGCG AAGCGCCTTT CTACACGCTG
GGGCCGCTGA CCACCGATAT CGCGCCGGGC TACGACCACA TCACCTCCGG CATCGGCGCG
GCGATGATCG GCTGGTTCGG CACCGCGATG CTCTGCTACG TGACGCCGAA GGAGCATCTG
GGCCTTCCCG ACCGCAACGA CGTCAAGATG GGCGTCATCA CCTACAAGAT CGCCGCGCAT
GCCGCCGATC TCGCCAAGGG GCATCCGGCC GCGCGCATCC GTGACGATGC ACTGTCGCGC
GCGCGCTTCG AGTTCCGCTG GGAGGATCAG TTCAATCTCT CGCTCGACCC GGAGACGGCG
CGGAACTTCC ATGACGAAAC GCTGCCGAAG GAGGCGCACA AGGTCGCGCA TTTCTGCTCC
ATGTGCGGTC CGAAATTCTG TTCGATGCGG ATTTCCCACG ACATCCGTGC CGAGGCGCAG
AAGGAGGGCC TTGAGGCGAT GGCAGCGAAA TACCGCGACG GCGGCGATCT CTATATGCCG
GTCGATGGGA GCGATGCGCC CACTGGAGAA TGA
 
Protein sequence
MNLFKPLGAT PSVTQGPLPA SQKVYRPGDI HPDIRVPMRE ITLHPTSGEP PVTVYDASGP 
YTLENADIRI DAGLPRLRHD WILKRGGVES YEGRAVKAED NGFASGERLT PEFPVRNQPL
RAKPGQAVTQ IAFARAGIIT PEMEFIAIRE NLGRKAAKEV LTRDGESFGA SVPDFVTPEF
VRQEVAAGRA IIPANINHPE SEPMIIGRNF LVKINANIGN SAVTSSMAEE VEKMVWATRW
GADTVMDLST GRNIHNIREW IIRNSPVPIG TVPLYQALEK VNGIAEDLSW EVFRDTLIEQ
AEQGVDYFTI HAGVRLHYIP LTVDRVTGIV SRGGSIMAKW CLHHHQESFL YEHFEEICDI
CRAYDVSFSL GDGLRPGSIA DANDRAQFAE LETLGELTQI AWARDCQVMI EGPGHVPMHK
IKENMDKQLA VCGEAPFYTL GPLTTDIAPG YDHITSGIGA AMIGWFGTAM LCYVTPKEHL
GLPDRNDVKM GVITYKIAAH AADLAKGHPA ARIRDDALSR ARFEFRWEDQ FNLSLDPETA
RNFHDETLPK EAHKVAHFCS MCGPKFCSMR ISHDIRAEAQ KEGLEAMAAK YRDGGDLYMP
VDGSDAPTGE