Gene Smed_5551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5551 
Symbol 
ID5319853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp516499 
End bp517824 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content60% 
IMG OID640777300 
Producthypothetical protein 
Protein accessionYP_001314232 
Protein GI150377637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0620848 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAA GTCACCGCAC GCGGGGTGCA CCCACCAACT GTGCTGAAAA TACTTCGAAA 
GGTTTACCTG CGGACGATAC GGCCAGAATT GACGGAAAAA TGAACTGGGG CGGTCTACAA
AGTCATCAGA CTATCCGCTG GGCGGTCGCC TACGGCTTGT TCGGCGCGCC CCGCTCAGGA
GCGTCGATAG CCTTCGCCTT GGCAGCACTC TCGATCACGG GCAACCCCGC CAGTGGGGCA
ACGGCGCTCG CGGCCATGAC AGCCGCGCAA CTTGTCGGAG CCTTGCCAAT TGCCCGGTTG
GGGCGGGGTG CGAACTCAAT CGCATTCTTC AAGGCTCTCA TCTTTATTCG GACGGTAGCA
CTGTTCGCTT GCGCTACTGC ATGTGGAGTG GGTGCGCCGT ACGCGGTTTT CATCGCTTCG
GCAGTGGCCG CTGGCCTCGT GCAAGGTGCG GCCTTCGGCT ATCTGCGTGA CAGCGCAAAC
TATCTCGTTG TGCAGTCACA GATGACAAGG GCCCTTGCCT TGGCAGCTTT TGCATCGGAC
CTCACCTTCC TCGTCACCCC GATTCTGGCA GCCAGTCTTG GATCGGTGTC CGCATCCTTC
TCGATAGCTG TCATCGCGAT ACTCGGCGCG GTGCCTGCAA TGATCTTGCC GTCCGCAAGA
GGCAATGTGG AGCCGCAAGC GGTGGAAACC CGAACAAGAT CGCTGACGCC TGAAGTCCTT
CTCTGGCTCG GATGCGCCTG TGCCAGTTCC GCCGCGATCA GCGGCATCGA AGTTGGCGCG
GTCTCGCTTG CGATGGGGTT CGACCTGCAA GCAGGGTACG GCGCGATATT CACTGGAACC
CTGTGTGCGG CGTCGCTTGT CGGATCCGTG GGAAACGGCC TGCTCAACAA AGCCTACTCG
AAGCAGCATG TCGCGGCGAT GTTCCTCTCG ATCATTATCG GCATGGCGCT GATACTTCAG
AATTCCTTCG GGTTATCGAT ACTCGGCTGC GCGCTTGTCG GGATATGCGC CCCCTCGCTC
GGCATCCACT ACTCATTGCA GCTCAACAGA CTGGTGACGC CAGAAATGAG GGCTGAGGTT
TTCTCCGTGC TGAAAATTTC GACTTCATTG GGAACTATCC TAGCCTCGGC GACGCTCGGC
TGGACTTCTG TCACGTTTGC CCTGACCGCT TCGATATGGA TACTTGCTTG CGCGCTGGCC
GCAATTCTCT CGAAGGACTT CATCATTGGG CGGCGGTTGT CCGAGACGCG CCTCAGTGGC
GGCGCCGAAC TAGCGACGGC CCCTGCCCTG ACCGCGGGGC GACGAAGTGA ACGGAATGAC
CGTTGA
 
Protein sequence
MFPSHRTRGA PTNCAENTSK GLPADDTARI DGKMNWGGLQ SHQTIRWAVA YGLFGAPRSG 
ASIAFALAAL SITGNPASGA TALAAMTAAQ LVGALPIARL GRGANSIAFF KALIFIRTVA
LFACATACGV GAPYAVFIAS AVAAGLVQGA AFGYLRDSAN YLVVQSQMTR ALALAAFASD
LTFLVTPILA ASLGSVSASF SIAVIAILGA VPAMILPSAR GNVEPQAVET RTRSLTPEVL
LWLGCACASS AAISGIEVGA VSLAMGFDLQ AGYGAIFTGT LCAASLVGSV GNGLLNKAYS
KQHVAAMFLS IIIGMALILQ NSFGLSILGC ALVGICAPSL GIHYSLQLNR LVTPEMRAEV
FSVLKISTSL GTILASATLG WTSVTFALTA SIWILACALA AILSKDFIIG RRLSETRLSG
GAELATAPAL TAGRRSERND R