Gene Smed_4723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4723 
Symbol 
ID5318903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1245940 
End bp1247010 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content62% 
IMG OID640776521 
ProductdTDP-glucose 4,6-dehydratase 
Protein accessionYP_001313453 
Protein GI150376857 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1088] dTDP-D-glucose 4,6-dehydratase 
TIGRFAM ID[TIGR01181] dTDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00234117 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGAGTCT TGGTAACGGG TGGCGCGGGT TTCATTGGTT CGGCGGTCTG CCGGCATCTG 
ATCCGATGCG GGGCGGAGCG CGTGGTCAAT GTTGACAAAC TGACCTATGC CGGCAGCCTT
GCGTCATTGC GAGCGGTGGA AGGCGACCCG CGCTATGCCT TCTACCGTGC CGACATTCTC
GACGAAAAGG TCCTGCTGCA GATCATGCGC CGGGAGCGCG TGGACGCGAT CATGCATCTT
GCTGCCGAGA GCCATGTCGA CCGTTCTATC GAAGGCCCCG ACCTTTTCAT GGAGACGAAC
GTCCTCGGCA CGGTACGGCT GCTTAACGCG GCACTCGCCT ATTGGTCCGG GCTAGGCCTC
GAAGGGCAGG AGCGTTTCCG CTTCCATCAT GTCTCGACCG ACGAGGTCTT CGGCGACCTC
CCGTTCAACC GGGGCATCTT CTCCGAGGAG AGCCGTTACG CGCCCTCGTC GCCTTACGCG
GCTTCAAAGG CGGCGGCGGA TCATTTTGCG CGCGCCTGGC ATCGCACCTA CGGCTTGCCG
GTGGTCGTCT CGAATTGCTC CAACAATTAC GGCCCATTCC ATTTCCCGGA AAAGCTGATC
CCGCTGACGA TCATCAATGC CATAGAGGAA GAGCCTTTGC CGCTCTATGG CTCCGGAGCG
AATGTCCGCG ATTGGCTCCA TGTGGACGAT CACGCAGCTG CCCTGGACCT GGTCATAAGC
CAGGGCAGAC CAGGGGAAAG CTATAATATC GGCGCCCGTG CCGAGCGCAA TAATCTCTCG
GTCATGGAGA GCATCTGCGA TCTCATAGAC ATGAAATTGC CGCGCAAGGG CGGCGGCAGC
TACAGGGACC TCATCACCCT TGTCCCCGAC CGTCCCGGTC ACGACCGGCG CTATGCGATC
GATCCTTCGA AGGTCGAGCG TGAGCTCGGC TGGAGACCGA AGCGGAGCTT CGAGGCGGGA
TTGAGCGAGA CGGTCGACTG GTTCCTCGCA AACCGCTGGT GGTGGGAGCC GATCCGACGT
GAACGCCATT CGGGATCCCG CATCGGCGGG CTGCATCGGA GCGTGGCGTG A
 
Protein sequence
MRVLVTGGAG FIGSAVCRHL IRCGAERVVN VDKLTYAGSL ASLRAVEGDP RYAFYRADIL 
DEKVLLQIMR RERVDAIMHL AAESHVDRSI EGPDLFMETN VLGTVRLLNA ALAYWSGLGL
EGQERFRFHH VSTDEVFGDL PFNRGIFSEE SRYAPSSPYA ASKAAADHFA RAWHRTYGLP
VVVSNCSNNY GPFHFPEKLI PLTIINAIEE EPLPLYGSGA NVRDWLHVDD HAAALDLVIS
QGRPGESYNI GARAERNNLS VMESICDLID MKLPRKGGGS YRDLITLVPD RPGHDRRYAI
DPSKVERELG WRPKRSFEAG LSETVDWFLA NRWWWEPIRR ERHSGSRIGG LHRSVA