Gene Smed_5154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5154 
Symbol 
ID5319456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp107886 
End bp109877 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content60% 
IMG OID640776932 
Producthypothetical protein 
Protein accessionYP_001313864 
Protein GI150377269 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGAAGT TCTCGCGTGC CAATGCTCCA GATCGTGCCG CAGCCGCCAG CGGAATGCAG 
CAGATCGCGC TTGCGCGCGG CACGGCAGCG GCAGCACAGC GGTCACGTCT GCCAACCTTC
GTCGCAACCG CAGCCACGGT TGCAGTGCTC TATTTCGCCC GTGATGTGTT TCTGCCTCTG
GCGATCGCCA TTTTGCTCAC CTTCGCTCTC GCGCCGCTGG TTTCGAAACT GCGCAGGGTC
GGTTGTCCCC GCTCAGTGGC GGTTGTCGGC ACGGTAACAA TCGCCTTTCT GTTCCTGTTC
GCATTCAGTG GCGTTGTCGC AATGCAAGTC AGCGAGGTCG CGCAAAATCT GCCGACCTAT
CAATACAACA TTGTCGAGAA GGTCAGGAGT TTGAAGGAGA CAGGGTCGGA GAGCCAGATC
CTCGACCGCC TTGGTCGCGT GATGGAGAGG ATCAACATAG AAATCAGCCG GCCCGAACCC
GAGCTTCGCG CCTCACCCGA AGCCGAACCT GAGAGAAAGC CCGTTCTTGT GGAGATTTTT
TCTCCACAGC GACCCGTCGA GACATTGAAG AATATCATTA ACCCGCTGCT TGGCCCCTTG
GCAACGACTG GCCTGGTCCT CGTCGTGGTT ATCTTCATGC TCTTCGAGAG GGAGGACTTG
CGGGACCGTT TCATTCGGTT GGTCGGCTAC GGCGATCTGC ATCGAACGAC CGAGGCACTT
CAGGACGCGG GTGCCCGGGT TGGCCGCTAT CTGCTCATGC AACTGGTCGT GAACATCACC
TACGGCATTC CGCTGGCGAT AGGCCTCTCG CTGCTAGGCA TTCCCAATGC CGTTCTCTGG
GGGATGCTGG CCATCCTACT GCGCTTCGTC CCCTATATCG GACCGGTGAT CGCCGCGGCG
CTGCCATTGT TCCTCGCTTT TGCCGCGGCA CCGGGATGGA GCTTGCTCAT TTGGACGGCT
GCCCTTTTCA TCGTTCTCGA ACTTCTCAGC AATAATGTGG TCGAGCCTTG GCTGTATGGT
TCACGCACCG GTCTGTCTCC GCTTGCGATC ATCGTCTCGG CGATCTTCTG GGCATGGCTC
TGGGGGCCGG TGGGATTGGT GCTGTCGACA CCTCTGACCG TGTGCCTCGT AGTCCTCGGA
CGGCACGTCC CACAGTTCGA GTTCCTGGAG ATTCTGCTGG GCAACGAGCC CGTTCTCGAT
CCGAAGGAGC GCCTCTACCA GCGCCTGCTC GCCGGTGATC CGGATGAGGC GACGGACAAC
GCCGAAGATA TGCTTGAGGA GAAATATCTC GTCGAGTTCT ACGACACCGT TGCGATCCCC
GCATTGCTCC TGGCCGAGCG GGACCGCGCA CGGGGCGCCT TGACGAGCGC CCAGGCAGCT
CAGATTGCCC AGAGTGCGAA CACACTTATT GCCAATCTCG AGGAGATTGC CGGCGAAGAG
GAGGGGGAGG AGGAAACGAG CACGGAAGAT CAGGAAAGTG ACGGGGACGG GGACATGGAT
GAATATGATC TTCCCACGGG AGACGGGAAA TCTGTCCTTT GCGTCGGGGG GCGTGGTGAC
CTCGACGATG TTGCCGCTTC AATGCTCGCC CAGACCCTTT GGATCCAGGG CGCGGATGCT
GCGCAGGCCG GCCACGAAGT TCTCAAGGCG GGCAATATAA AAGGTCTGAA GCTTGAAGGG
CGCAACGCAG TCGTCCTAAG CGTTCTCGAT CAGGATTTTA TGCGGCATGC CAAATTTACC
GTGCGCCGGC TGAAACGTAT GGCGCCTGCG GCGCGTATCG GGATCGTCCT GTGGCAGGAG
AACGGCCGGC CAGGAGCAAC CGAGCGAGAC CAACTTATTG AATCGATGCA GGCGGATTTC
GTCGTATTCG GGATGGGGGA CGCCGTCCGC GAGGCGCTCT CCGACGCCGC TCCTCGTCCG
TTGAAACTCG CCCATCCAAA GATCGCACCT GGATACGCCA TGCGGCGAAG CAAGCGCGTC
GAGAGATCAT AA
 
Protein sequence
MLKFSRANAP DRAAAASGMQ QIALARGTAA AAQRSRLPTF VATAATVAVL YFARDVFLPL 
AIAILLTFAL APLVSKLRRV GCPRSVAVVG TVTIAFLFLF AFSGVVAMQV SEVAQNLPTY
QYNIVEKVRS LKETGSESQI LDRLGRVMER INIEISRPEP ELRASPEAEP ERKPVLVEIF
SPQRPVETLK NIINPLLGPL ATTGLVLVVV IFMLFEREDL RDRFIRLVGY GDLHRTTEAL
QDAGARVGRY LLMQLVVNIT YGIPLAIGLS LLGIPNAVLW GMLAILLRFV PYIGPVIAAA
LPLFLAFAAA PGWSLLIWTA ALFIVLELLS NNVVEPWLYG SRTGLSPLAI IVSAIFWAWL
WGPVGLVLST PLTVCLVVLG RHVPQFEFLE ILLGNEPVLD PKERLYQRLL AGDPDEATDN
AEDMLEEKYL VEFYDTVAIP ALLLAERDRA RGALTSAQAA QIAQSANTLI ANLEEIAGEE
EGEEETSTED QESDGDGDMD EYDLPTGDGK SVLCVGGRGD LDDVAASMLA QTLWIQGADA
AQAGHEVLKA GNIKGLKLEG RNAVVLSVLD QDFMRHAKFT VRRLKRMAPA ARIGIVLWQE
NGRPGATERD QLIESMQADF VVFGMGDAVR EALSDAAPRP LKLAHPKIAP GYAMRRSKRV
ERS