Gene Smed_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1518 
Symbol 
ID5322376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1599659 
End bp1601659 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content63% 
IMG OID640790465 
Producthypothetical protein 
Protein accessionYP_001327197 
Protein GI150396730 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTTCA GGCTTGGCTT GCGGGACTGG CTGCTGGCCA ACGACCCGGC CTTATCCCGG 
CTGAGGCAGG CGTCGCGCAT TACCGCGACG GTGGTGTTCT CAACCGCGCT GCTCGTTCTT
TTCCATTTCA TCGTCACATC GCTTCCGCCG GCTGCCTATG GGCTTGCGAT AACGCTGTCG
ATCGAAGGCG GTCTGGCAGT TCGCGACAGG ACCGCATCGG AACAATTGGT CACACGCATT
CTTGCGGTCG TCACCGCGGT CGCAATGGTC ACGCTCGCTT CGGCCCTGGA AGACCACCGG
CACATTTCAG ACTTTGTTTT TCTCGTCATC ATCTTCGTGG CCGTCTATGG CCGCGCCTTC
GGTCAACGCT GGTTCGCCGT CGGCATGTTC GCTTTCATGT CCTATTTCAC AGGAGCCTAT
CTGCGGCCGT CTCTCGACCA GTTGCCGGCG CTCGTTCTCG GCGCCGGAAT TTCGGCTGCC
GTCGCGCATC TGGTGCGGAC GGTCCTGCTC CCGGACGACC GGTACCGTGA CCTCTTGCGG
GCGATCGCGA GCGTCCAGCA GAGGGTGGAT GATATTCTCC TCGGGATCGT GGCCGCCGCG
CGGAAATCGA GGATTGCCGA TCGCCGCAGG CTGCACGCGC TCGAGGAGCG GCTGAAGGAA
TCCGTGCTGA TGGCGGAGAG CTTTATCCCC ATGGACAGCA GCCGCCCGGC ACCGGAGCCT
GGCGCCGCAT CTGCGGACCT CGCCATAGTG CTCTTCGACA TTCACCTCGC CGCGGAGAGC
GTGATCGTAC TCAGCCTTCA GGCCATGCCG CCTGCGGCCC TCGTCGAGGC GGTGATGGCG
CGCGATGCGA AGACCATCGA AGAACGGATG CTGTCCCTTG ACGGAGAGAA CGTCAAACAG
GTCGAGTCGG CCAAAGCTTT GCTCTGGCTT CATTCGGTCC GCGAACGACT GCATTCGAGC
CTGGCCGGCA TCAGGGAGGA GGACCTCGAG GAACTGCCTT CGGCTCAACC ACCAAAGCTG
TCAGCCACTC GCCTTTCCAT CGCGAACCCG GCTTTACGCA ATGCAATCCA GATCACGCTT
GCCTCGGGGA TCGCGATGGT GTTTGGCCTG ATGCTTTCGC GCGAGCGCTG GTTCTGGGCA
GTCCTCGCCG CCTTCCTCGT CTTCACCAAT ACGCGCTCGC GCGGAGACAC CGTAGTCAAG
GCCCTGCAGC GGTCGGCCGG CACGCTTGCG GGCATCATCG TCGGACTTGC TGCGGCAAGC
GCCATCGGCG GGAACATCTA TGTCGTCCTG CCATTGGGTG CCGCCTGCAT TTTCCTGGCA
TTTTATTTTC TCCCGGTTTC CTACGCGACC ATGACCTTCT TCGTCTCGGT CGTACTATCT
CTCGCCTACA GCCTCCTGGG CGTTTTGACG CCGCAACTTC TGGAGTTGCG CCTTGAGGAA
ACGCTGATCG GCTCCGTGGC CGGTGCTGCT GTGGCTTTCG TGGTGTTCCC GACGAGTACC
CGCACGACGC TCGATGCCGC AATCAGGAAT TGGTGCGACA GGCTGGTCGA CCTGCTCGAG
GAGGCAAAGA AGGGAACGAC CGGCCTGAAT TTGGTCAGCC GGTCGCAGGC GCTCGATCGT
GCCTACAGGG ATCTCACCGC GGCCGCCAAG CCGCTCGGCG TTTCCTGGCA GCTCGTCACC
CGACCGGGGC ACGTGCGCCA GACGCTGGCC GTGTTCATGG GATGCACCTA TTGGGCGCGA
ATCGTCGCGC GGAAGATGTC GCAGACAATG AAAGACCCCG CGGCCTTCAC GGCTCGGATA
GAAGAAAATC TAAAGCTCGC CGGCAAGGTG CGCGAAAACG CAGCTGGCTA TTTCTACCAG
TCCCACAGCG TCGCCGGTCC GATAGAACGT CACCTGCCAG TGTCGCGCGA CGATGCCGGC
CTTGGGCTGG AGATGGTCGC CGTTTCGCTC GGGCGCCTCC ATTTTCCGCC GCCGCCCGTC
GGCGAGGAGC GCGGTCTTTA A
 
Protein sequence
MTFRLGLRDW LLANDPALSR LRQASRITAT VVFSTALLVL FHFIVTSLPP AAYGLAITLS 
IEGGLAVRDR TASEQLVTRI LAVVTAVAMV TLASALEDHR HISDFVFLVI IFVAVYGRAF
GQRWFAVGMF AFMSYFTGAY LRPSLDQLPA LVLGAGISAA VAHLVRTVLL PDDRYRDLLR
AIASVQQRVD DILLGIVAAA RKSRIADRRR LHALEERLKE SVLMAESFIP MDSSRPAPEP
GAASADLAIV LFDIHLAAES VIVLSLQAMP PAALVEAVMA RDAKTIEERM LSLDGENVKQ
VESAKALLWL HSVRERLHSS LAGIREEDLE ELPSAQPPKL SATRLSIANP ALRNAIQITL
ASGIAMVFGL MLSRERWFWA VLAAFLVFTN TRSRGDTVVK ALQRSAGTLA GIIVGLAAAS
AIGGNIYVVL PLGAACIFLA FYFLPVSYAT MTFFVSVVLS LAYSLLGVLT PQLLELRLEE
TLIGSVAGAA VAFVVFPTST RTTLDAAIRN WCDRLVDLLE EAKKGTTGLN LVSRSQALDR
AYRDLTAAAK PLGVSWQLVT RPGHVRQTLA VFMGCTYWAR IVARKMSQTM KDPAAFTARI
EENLKLAGKV RENAAGYFYQ SHSVAGPIER HLPVSRDDAG LGLEMVAVSL GRLHFPPPPV
GEERGL