Gene Smed_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1451 
Symbol 
ID5322305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1530377 
End bp1531678 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content64% 
IMG OID640790395 
Productpeptidase M15A 
Protein accessionYP_001327131 
Protein GI150396664 
COG category[S] Function unknown 
COG ID[COG3108] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.556547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00120403 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGCAACATC TGGAAACAGG ACGCGGTCTC CTGACAATCG GACGCGCCGC GGCATTATCC 
GTATCGCTGC TGGCACTCGC CGGATGCGTT TCCGCCGTTG CGGATGGCGA GACGATCGAC
CCCCTCAAAT CGCAGCAGAC GGCGGAAGCA TCATTGGCAG GCTCCCAGGA AATATCGCCG
GGTAAGCAAG AGGCGGTGCC TGACGACAGG ACCGCGCAGG CGCCCGACGG CGCGACGGCG
GATGCCGGAC AGAGCGCAGC CGTGCAGCCC GGCCTTACCA TGCAGGGAAC TGCGCTACGC
GCCACATCAT CGAGTATTTA TGGAGAATCG CCGGCGGCGA CATCCGCAGC CGTGCAGCCG
GACCCAAGCA ACCAGCCGAC GCCAGCCGCC GCAGCACCGA GAATGAATGC CAGGACCAAC
AGCCTGTTCA GCAACGGGCA ATCCGAGGCC CAAACGGCGA ATCAGCCGCC GCAGGGGGCC
TCGGCAGGGC AAACTCCTGC GGCAAACGAA ACGATTGCGG CTACAGGTCC GACTGCCGCA
GTCGATATGC CCGTGTCGGT ACCGTTGCCT TTGAGCGCGC AGGCAGCACT GTCAGGAGCG
ACCGCGTCGG CCCTGCAACC GGTCGAAGTC GCTTCCGCAG CGGCGGTGAG CACGCCTTCC
GCGGGCCCCG GCGAAGGCGA GAAGGAGACA AAAGGGGCGA AGAAGACCTG GACCCTGGCG
AGCCTGTTCG CGCCCAAACG CAAGGAAATG CCGCGCGAAA CACACGCTGC GCAAGCAAGC
CGGAAGAAGA CGATAACCGT GAGTAATGCG GGTCAGCCTC AGATCGCATC TCTCGCCTAT
GCTTCCCTGC CGGGCGTCAA TATGAATCCG CTCTTCAGCG TGGAGCACGA TGCGCATGCT
GCCGACGAGG ACGACGCGCC CTTGGAAGTG GCCAACCTCT CCGGCCTCGC CCGACTCACA
CCGAACGGCC TCATACTGCA AACCGAGAAG GTGGAAACAG GCTGCTTCAA ACCGGAGCTT
CTCAACATTT TGAGAACGGT GGAAGCGCAT TACGGCCGCA AGGTCATGGT CACCTCGGGC
CTGCGCGCCA TCAAGGTCAA CCGCAAGCGC CAGTCCCGAC ACACACGATG CGAGGCGGCC
GACATTCAGG TGGCGGGCGT CAGCAAATGG GAGCTTGCAG ATTTCCTGCG CAAAGTTCCG
GGCCGCGGCG GCGTCGGCAC CTATTGTCAC ACCGAATCCG TACACATCGA CATCGGCCCG
CAACGGGACT GGAACTGGCG GTGCCGCCGC CGCAAGGGTT GA
 
Protein sequence
MQHLETGRGL LTIGRAAALS VSLLALAGCV SAVADGETID PLKSQQTAEA SLAGSQEISP 
GKQEAVPDDR TAQAPDGATA DAGQSAAVQP GLTMQGTALR ATSSSIYGES PAATSAAVQP
DPSNQPTPAA AAPRMNARTN SLFSNGQSEA QTANQPPQGA SAGQTPAANE TIAATGPTAA
VDMPVSVPLP LSAQAALSGA TASALQPVEV ASAAAVSTPS AGPGEGEKET KGAKKTWTLA
SLFAPKRKEM PRETHAAQAS RKKTITVSNA GQPQIASLAY ASLPGVNMNP LFSVEHDAHA
ADEDDAPLEV ANLSGLARLT PNGLILQTEK VETGCFKPEL LNILRTVEAH YGRKVMVTSG
LRAIKVNRKR QSRHTRCEAA DIQVAGVSKW ELADFLRKVP GRGGVGTYCH TESVHIDIGP
QRDWNWRCRR RKG