Gene Smed_2983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2983 
Symbol 
ID5323860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp3130780 
End bp3132213 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content62% 
IMG OID640791934 
Productdihydropyrimidinase 
Protein accessionYP_001328647 
Protein GI150398180 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR02033] D-hydantoinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCG ATAAGGTCAT CCGTTATGGA ACGGTCGTCA CCGCCAGCGA TACATTCAAG 
AGCGATGTCG GCATTAAGGA AGGCCGCATC GCGGCGCTTG CGGGGGCGTT GACGGAGGCG
GACGAGATCA TCGACGCAAC CGGCCTCTAC GTCATGCCCG GCGGCATAGA CAGCCATGTT
CATCTGGACC AGCCCTCCGG TGAGGGCATC GTCATGGCCG ACGACTTCGA AAGCGGTACC
CGTTCGGCGG CGATCGGCGG AAACAGCACG GTTCTGACCT TCTGCATGCA GGAGAAGGGC
CAGAGCCTTC GCGAGGCACT GAGACTTTAC CACGCCAAAG CGGAGGGACG CTGCCACATC
GACGTATCCT TCCACCTGGT CGTCACCGAT CCTACGCCCG AGGTGCTCGG GCAGGAACTG
CCGGCGCTCG TCGCCGACGG CTACACCTCG ATCAAGGTCT TCATGACCTA TGACGGTCTG
CGGCTGCGCG ACGACGAAAT CCTCGCCACG CTCGACGCGG CGCGCCGAAC CGGCGCATTG
GTCATGGTTC ACTGCGAGAA CGAGGACGCG ATCCGCTATC TGATCGGGCG CCACGAGGCC
GAAGGACAGG TCGAACCCAG ATACCATGCA GCCACTCGGC CGATTGCCGC CGAGCGCGAG
GCAACGCACC GGGCTTTGTC GCTCGCCGAG ATTGTCGACA CGCCTGTGGT CATCGTTCAT
GTTTCCAATC GTCAGGCGAT GGAAGAGATC AGGCGGGCGC GGCAACGCGG CCAGAAGATC
GCCGGCGAAA CATGCCCGCA ATATCTGATG CTGACTTCCG AGGATCTGGA TGCCGACGCC
CTTGAAGGAG CGAAATATGT CTGCTCGCCG CCGCCTCGGG ACAAGGAAAG CCAATCGGCA
TGCTGGGAGG GCATTGAGCA AGGCGTGTTC GATCTCTTTT CTTCGGATCA CTGCCCGTTT
CGCTTCGACG ATCCCGCAGG CAAGCTGAAC GAGAAAGGCA GACGGCACTT CCGCTGGATA
CCCAACGGCA TCCCGGGTGT CGCTACGCGG CTGCCTATCC TCTTCTCGGA AGGCGTGATG
AAGGGACGGA TCGATCTGAA TCGCTTCGTC GCGGTAACAT CGACCAACCA CGCAAAGCTC
TACGGACTCT ATCCGCGCAA GGGCACGATC GCGATCGGAG CCGATGCCGA CATCGCACTC
TGGGATCCGG ATATGCAAAT CACGCTGACG AACGAGATGC TTCGGCATGG GGCCGATTAT
ACGCCCTATG AGGGCCTCGC CGTTCGCGGC TGGCCGGTTC GGGTCATGGT CCGAGGCACC
ACGGTCGCCA AAGACGGGCA TCTGTGTGAT AGCCGACGCG GCGGATCCTA TCTGCACAGG
ACGCTCTCCT CGTTGACGAG GAATGCAGGC AACGGGAACC GTAACCGCGG TTAG
 
Protein sequence
MRFDKVIRYG TVVTASDTFK SDVGIKEGRI AALAGALTEA DEIIDATGLY VMPGGIDSHV 
HLDQPSGEGI VMADDFESGT RSAAIGGNST VLTFCMQEKG QSLREALRLY HAKAEGRCHI
DVSFHLVVTD PTPEVLGQEL PALVADGYTS IKVFMTYDGL RLRDDEILAT LDAARRTGAL
VMVHCENEDA IRYLIGRHEA EGQVEPRYHA ATRPIAAERE ATHRALSLAE IVDTPVVIVH
VSNRQAMEEI RRARQRGQKI AGETCPQYLM LTSEDLDADA LEGAKYVCSP PPRDKESQSA
CWEGIEQGVF DLFSSDHCPF RFDDPAGKLN EKGRRHFRWI PNGIPGVATR LPILFSEGVM
KGRIDLNRFV AVTSTNHAKL YGLYPRKGTI AIGADADIAL WDPDMQITLT NEMLRHGADY
TPYEGLAVRG WPVRVMVRGT TVAKDGHLCD SRRGGSYLHR TLSSLTRNAG NGNRNRG