Gene Smed_5539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5539 
Symbol 
ID5319841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp503715 
End bp504923 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID640777290 
Producthypothetical protein 
Protein accessionYP_001314222 
Protein GI150377627 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.188295 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0555001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCGAC GAGTGGCCGA TCAACTCACA ATCGTTCTCG CAGGATCCGC TTCTCTTGCT 
ATCGCAATGG GTGTTGGTCG TTTTGCGTTC ACCCCAATCT TGCCGATGAT GCTGCACGAC
GGAACGGTCG ATCTCACTGT TGCCGGAAGG CTCGCAACCG CAAACTACGT CGGGTACCTC
GTGGGAGCGT TGGCGGCGAT GCTAATACCG AAGGAGTGGC CGCAGACGAC GATTATCAAA
TCGGCTCTTC TCCTGACCGT CGTGCTGACT GCCTTGATGG CGCTTCCGTT CCCAACGACA
TGGACAGGAC TACGATTTCT CGCAGGCGTA GCATCGGCAA TCGGCTTTGT ATTCACGTCA
GGTTGGTGCC TGGCGGAGCT TACCGGTTCA ACCGCCTCGA TCGGCAGTGC GATCTTTACC
GGGCCCGGAG CGGGGATCGC CGTTTCTGGT CTAGCAGCCA GCGTGATGAC AGCATTGAGC
TTCACTGGGC AGGGGGCCTG GACGACATTT GCCGTGATGT CGGCGGCGAT CAGCGCACTT
ATCTGGCGTG TCTTTCGAAA GCCCGGCAGA GACGTGGAGC CATTTTATGT TGGTAAAGCG
AGGACGTCGG GTCGGGTCCC GCGAACCGAA ATGCCCATGT TCGCCGTTGC GTACGGGCTC
GCTGGCTTCG GTTACATCGT GACTGCTACA TACCTGCCTG TGATCGCGAA GAACGCAATA
CCGGATTCCC CTTTGCTGAC CGTCTTCTGG CCGCTGTTTG GCATTTCAGC CGTAGTGGGT
TCACTTCTGG CATCGCGTGT GCGGAAGAGC GCCGACGCTC GCCTATACCT CATTGGTTCG
TATCTCGTGC AAGCGGCAGG CGTAGCAATG TCTGTCGTGT GGGAGGACGC AGTCGGTCTT
GCCTTAAGCA GCATTCTAGT CGGTGTCCCG TTCACCGCGA TCAGCTATTT TGCCATGAAT
GAGGTCCGAC GGATCCGATC AAGTCATCAC GCCCGCTACA TGGGGCTGCT GACGGCGGTG
TTCGCCGTCG GCCAGATCAT GGGGCCGCCG GTCGTAGGCG CAATATTCGC CCGCCAGGTG
AACTCCGACA GTGCGTTTGC TCTCGCCCTA GGCATTGCAA GCATAACGTT GGTCGTAGGG
GCGGTGATCC TGGTGGCGAT GATCGTGATG TTCCCGGCCA GGGTTCATCA CAGGCCTCAG
GAGCGCTAG
 
Protein sequence
MSRRVADQLT IVLAGSASLA IAMGVGRFAF TPILPMMLHD GTVDLTVAGR LATANYVGYL 
VGALAAMLIP KEWPQTTIIK SALLLTVVLT ALMALPFPTT WTGLRFLAGV ASAIGFVFTS
GWCLAELTGS TASIGSAIFT GPGAGIAVSG LAASVMTALS FTGQGAWTTF AVMSAAISAL
IWRVFRKPGR DVEPFYVGKA RTSGRVPRTE MPMFAVAYGL AGFGYIVTAT YLPVIAKNAI
PDSPLLTVFW PLFGISAVVG SLLASRVRKS ADARLYLIGS YLVQAAGVAM SVVWEDAVGL
ALSSILVGVP FTAISYFAMN EVRRIRSSHH ARYMGLLTAV FAVGQIMGPP VVGAIFARQV
NSDSAFALAL GIASITLVVG AVILVAMIVM FPARVHHRPQ ER