Gene Smed_5441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5441 
Symbol 
ID5319743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp404004 
End bp406565 
Gene Length2562 bp 
Protein Length853 aa 
Translation table11 
GC content61% 
IMG OID640777204 
Producthypothetical protein 
Protein accessionYP_001314136 
Protein GI150377541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.176021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTTT CCCCTTTTGA GACCCGCTCA CCGACAACGG GCGCAGAGCG CCCAATCACC 
CCGGACCTCC TCGCCTCAAC ATTCCTCGCT CCCCGCGCTG CATCCCATGA ACTCGCGGAC
ATCTTGCAGG ATTTTGACGC TCCATGGAGC AACAGCTATG CCGTGGATCC CGACGAGGTT
AGCCAGCACA TCGCCGCTGC GCTGAGACAG CTGGCTCTCG CGACATCCGG GCTGTCCGTC
GAAGCCATCA ATTTATCCAA TCTTCCCGAC GGTCGTGCGC GCCAACATCT CGCTGCCCTG
AAATCACTCT GGGAGCGGAT GGGCGATGCA CTGCCAGAGG ACCTCCATAT AGCGCGGCAC
GTTCTTGGCT GTTCAATTGA CGATGCTTTG GAGTCGCTGC CGATCATTGG CGGCCCCTGC
CCCTTTGAGT CACGCGCCGA ATCGGCGTTG CGCGAGCACC TCTCAATGCT TTTCGGCACG
GTTCCCGGGC CAGCGAACCC CACGCTAGCA GATGGGGCGT TGGGCCATGT CCAACAACAC
CTCCTCGCCA CGAGTCCCCC GGTCGCGCCT GACGATACCC TGCAGGTCTT CGGTTTGCGA
GATCCGCGCG AGGAAATCGC CTTCGCAGCG GCCCGGGCAC AGGCACTGAT CAGGCAGGGC
TATCGAGCAC AGGACATCGG CCTGCTCGTG CCCGATGACC CGGCGTACCA GCTGGCAATC
GAGCCGGCCT TTACCCGCGT CGGGCTTCAG CTTTCCGGTA TGGCAGCCCC GCTATCCTTG
CGTGACCATG CCGGTGAGTT TCTAACCAAT CTTCTCGCAA TTATGCGCGG CCCCGCTCCG
CGCATGGCAC TAGCAAGCAT TTGCATCTCC CCGCTGACAC AGTGGCCCGC TGACGCTGGC
CGCGATTTTG CGAGCGAGTA CATCGAGAAC GGCTGGTCCA GGGCGGCGCG CGCGTACAAT
GGACTGGGCG CAAAGATCTT CGACGAGTTA CGGCCCGTTT CCACACCGGG GCAACTCATC
GCCAGACTCT GCTCGATCGC AAACGAACTG TTTGATCCCG CGCAATTGAT GCCACGGATC
AACGCGCTGC GGCCGCTCCT TAGAGGCGAA TACATCGATT GGAGCTCTCT CGCGCGAGTC
GTCGCTCCTG CCGCTCTCAC TCCTGACGAG GAGGGTCGGT TCGTCGAAGG CGTTTCGGTT
TTCAGCGAAA CGGCCCTGCC ATGGCGCTCG GTCCGACACC TCATCGTTGC GGGAGCTGCC
GGAACCTATT GGCCTCGCCC GGTCGCCGCA AACCCATTTT TCACGGAAAG CGAAATCGTG
ATGATTGAGG GGGCGACTAA CCTCAAGCTC CCGTCGCGCC GTCAGACTCT TGCCCGGCGT
CTCGAACTGT TTCGCCGCCA GCTTGGTGTA GCTACCGACG GCATCACACT TACCGCGTCG
GCCCGTGATC TTGAAGGCAA GCAACTTGCC CCGACAACCG GGCTCTCGCT GATAGCACGC
GCGCTCGGGG CGAACGACGC TGAAGCCCTC ATTGCCGACG CAGTCGACCA CGCTCAGTCT
TCGCAAACTC ACGGCGGCGC AATTACGTCC GATGCCCACT GGGACGAAGA AAACACGCGC
CCCGTTTTGC CCGAAGGGTC CGAAATCGGC ATTCCGGGCG ATCCTTTCGC TTTAGGCAAG
GCTGCGGACG GTGCGACCAA GCCGCAGTCG CCCTCTCGGC TAGAGACGAT GCTGGTTTCG
CCGCTGGCAT GGCTCCTCGG CGAAATTGGT GCGGAGGACC GCACATGGGC TCCGGAGTTA
CTTGACGTCT TAACGCTGGG CAAACTCGTC CACAGCACTT TGGAAACGCT CTTCCCCGAA
GGCGCTAGTG GCTTAGACGA AACGGCAATC CGCGTGGCAT TTCCAGCCGC CTTCGAGACC
GCGATCACCC AATCGGCTCC CTGGCTCGTT GGTGACAATT GGGTTAGCGA GCGTACCAAT
CTTGCCCGAG AGACGCTCGA GGCAGCACTT AATTGGGGTC GCTTCCTGAT TGACAACGGT
GCCCGGGTCC GCCGTGTCGA AACCCTGCTT TCCGGCGCCT ATCATCACCT TTCCATCAAC
GGTCGCGCAG ATTGCCTGCT TGAACTGGCG GATGACCGCA TAATGGTCGT GGATCACAAG
CGCTCGAGCT CTTCCTCACG CCGCAAGCGA ATGGAAGCGG CGGCAGATCT CCAGGTCGAG
CTCTACCGTC GCATGATCCC AACAACACCA GAGTTTGCCC AGGTCACCTC AGATCGCATC
GTCACTGCTT ACCACTGCAC GCTTGATGGT CGGGTCGTCA CCGGACCTGA AGGCAAAGGA
CTGAAGGGCG TAGTCACCGT CAACGGTCAA ATCGGATCGA CGGCCAACGA AATCGTGTCT
GGGCACATCA AGACTCTTTC GAAGGGGCGG CTTGCACTCA ACCTCGTGTC GGATAGCCAG
AAGTTTGAAA CAGGTCTCGG CATCAAACCC TATGCGCTCG ACAATCCGCT TGTAGCCGCT
TTCCTTCTCC CCGACCCAAG CCAGGAGGAC GCTAATGTCT GA
 
Protein sequence
MRLSPFETRS PTTGAERPIT PDLLASTFLA PRAASHELAD ILQDFDAPWS NSYAVDPDEV 
SQHIAAALRQ LALATSGLSV EAINLSNLPD GRARQHLAAL KSLWERMGDA LPEDLHIARH
VLGCSIDDAL ESLPIIGGPC PFESRAESAL REHLSMLFGT VPGPANPTLA DGALGHVQQH
LLATSPPVAP DDTLQVFGLR DPREEIAFAA ARAQALIRQG YRAQDIGLLV PDDPAYQLAI
EPAFTRVGLQ LSGMAAPLSL RDHAGEFLTN LLAIMRGPAP RMALASICIS PLTQWPADAG
RDFASEYIEN GWSRAARAYN GLGAKIFDEL RPVSTPGQLI ARLCSIANEL FDPAQLMPRI
NALRPLLRGE YIDWSSLARV VAPAALTPDE EGRFVEGVSV FSETALPWRS VRHLIVAGAA
GTYWPRPVAA NPFFTESEIV MIEGATNLKL PSRRQTLARR LELFRRQLGV ATDGITLTAS
ARDLEGKQLA PTTGLSLIAR ALGANDAEAL IADAVDHAQS SQTHGGAITS DAHWDEENTR
PVLPEGSEIG IPGDPFALGK AADGATKPQS PSRLETMLVS PLAWLLGEIG AEDRTWAPEL
LDVLTLGKLV HSTLETLFPE GASGLDETAI RVAFPAAFET AITQSAPWLV GDNWVSERTN
LARETLEAAL NWGRFLIDNG ARVRRVETLL SGAYHHLSIN GRADCLLELA DDRIMVVDHK
RSSSSSRRKR MEAAADLQVE LYRRMIPTTP EFAQVTSDRI VTAYHCTLDG RVVTGPEGKG
LKGVVTVNGQ IGSTANEIVS GHIKTLSKGR LALNLVSDSQ KFETGLGIKP YALDNPLVAA
FLLPDPSQED ANV