Gene Smed_5234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5234 
Symbol 
ID5319536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp193726 
End bp194814 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content60% 
IMG OID640777011 
Producthypothetical protein 
Protein accessionYP_001313943 
Protein GI150377348 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATCAA GCTATGTCCC GGAGCTGCCA AATGCTGCCG ACAACCCTGG GAGGTGCACG 
ATGACCGAAC GCGGTAGGGT GGTCGTACCG GGCATCCTGA CTGTAATCGC CACGGTGGCG
GCCGTGTATT TCTCCGGTGT CGTTCTCGCA CCTGTCGCAT GCGCGCTGTT TATCATTGCG
GTGCTGTGGC CAATTCAAAG TCGGCTAGAG GCACGCCTGC ACAGGGTATT CGCATTGGCC
ATCGTCGCGG CGCTGCTGTT CGGGACCTTT GTCGTGTTCA TGTCCGTCGT GACGTTGAGC
TTCGGCCGGA TCGGGCGGTC GCTGGCAATG GATGCCGGTC AGTTTCAGCT GCTTTACAAT
CGCCTTGCGG AGTGGCTAGG GGGGCACGGA ATAGCTCTCG CCGGGTTCTG GGCGGACAAC
CTCGACTCCC GGCTCCTGTT GCGCGCCCTG CAGGGGATCT CTGCTCGGCT GAACACCATG
GTCTCGTTCT GGCTCGTGGT TCTCCTCTAT GTCATTCTCG GGCTTTTGGA AGTGTCGGAC
CTGGGTGCCA GAATCCGCAG GCTCACGGAC GACAACGCGG CTCGCATCAT CAGCGGCTTC
GAACTCGCGG CCTCGCGTAT CCGACGATAT CTTCTCATCA GGACGATCAT GAGCGCGGCG
ACCGGAATTG CCGTCTGGGC GCTTGCCACG GCCTTCGGAC TGCGGTTCGC TGCCGAATGG
GGAATCGTCG CGTTCACCTT GAACTATATC CCGTTCATCG GCCCGGCATT CGCGACGATC
CTGCCGACCT GTTACGCCCT GGCTCAATTC CAGTCGCCTC AGTCGGCCCT GATCGTCTTC
GCCTGCTTGA GTACCGCCCA ATTCATAATA GGGAGCTACA TCGAGCCCAG AGTGGCTGGT
AACACACTCG GCATATCTCC GTCGCTCGTC CTCTTCTCCG TTTTTCTCTG GACCTTCCTG
TGGGGAATAT TCGGCGCGTT TATCGGCGTT CCGATAACCA TCGCCGTCCT TTCATTCTGC
GCCCAGTTCG CCTCTACGCG GTGGCTTGCG GAACTGCTGG GGAAAGAGAC GATAATGGAT
GGCGCATGA
 
Protein sequence
MQSSYVPELP NAADNPGRCT MTERGRVVVP GILTVIATVA AVYFSGVVLA PVACALFIIA 
VLWPIQSRLE ARLHRVFALA IVAALLFGTF VVFMSVVTLS FGRIGRSLAM DAGQFQLLYN
RLAEWLGGHG IALAGFWADN LDSRLLLRAL QGISARLNTM VSFWLVVLLY VILGLLEVSD
LGARIRRLTD DNAARIISGF ELAASRIRRY LLIRTIMSAA TGIAVWALAT AFGLRFAAEW
GIVAFTLNYI PFIGPAFATI LPTCYALAQF QSPQSALIVF ACLSTAQFII GSYIEPRVAG
NTLGISPSLV LFSVFLWTFL WGIFGAFIGV PITIAVLSFC AQFASTRWLA ELLGKETIMD
GA