Gene Smed_5028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5028 
Symbol 
ID5318776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1549052 
End bp1550515 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content65% 
IMG OID640776810 
Producthypothetical protein 
Protein accessionYP_001313742 
Protein GI150377146 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.924524 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAGAG CCAATGTCAA ACGGGCTGCG GCCCTCGCCC TGCCACTTGT GCTCGGCGGC 
TGCGTGTCCG CGAGCGAATA TGCAGCGAAG AACGCCGGGT TCTCCGCGGT CGAGGCCAAG
ACTGCCGAGG CCGTAAGAAA GCAGACGGTC TGGATACAGA ACCAGCAGCA CGCACGTGTC
GTATCCGACC GCGTGAAGAC GCTGATGGCT AAGAAAGCGA TCGATGTCGA GACGGCGGTT
CAGGTGGCAC TCCTCAACAA CAGGGGGCTG CAAGCCGCCT ACGCCGACTT GGGGGATTCC
GCTGCCGATG CCTGGCAATC GACGATGCTC GTCAATCCGA CGGTCTCGGT CGGCCTGACC
GGGATCGGCA CGCCGGGCCT CGAAGCATTC AAGTCGGTCG AAGGCATGAT CGCCAACAAC
ATCCTGGCGC TCGCCACGCG AGAACGGGAC ATCGCAATCG CCGACACCCG TTTCAGGCAG
GCACAGTTGA ACGCGGCACT GCGCACCCTG CAGCTCGCCG CCGACACGCG GCGCGCCTGG
ATCAATGCGG TTGCCGCCTG GGAAACGGTG GCCCAGCTCA ACCAGGCGCA GGCCGCAGCC
GACGCAGCGT CCGAGCTTGC CCAGGAGCTT GGCAAGAGCG GCGCACTCAC GAAAGAGGGG
CAGGCCCGCG AGCATGTTTT CTCTGCCGAA CTGGCGGGGC AGACGGCAAA GGCGAGGCTG
GAGGCAAGGC TCGCCAAGGA AGAGCTGACG CGGCTGATGG GCCTGTGGGG TTCGGGTATC
GACTATCAGG TTCCGAACCG TCTGCCGCCG TTGCCGAAGG GAATAATGAA GCGCGACCCG
ATCGAGGCGG AAGCATTGCA GCGCCGCGTC GATCTGCAGA TGGCGAAGCT CGACCTCGAA
GCGACCGCGA GATCCTACAA GCTGACGGAA GCGACCCGCT ACGTCACGGA CCTCGAACTC
CTCACGGGAT TCGAGACCGA GCGGGAACTC GAGGACGGCG ACATCAGGAG CGAAACGACC
GGACAGGCCG AACTCGAATT CGTCATTCCG ATCTTCGACA GCGGCAGAGC CCGCATGCGC
AAGGCGGAAC TTGCCTATAT GCGGGCGGCG AACCTCCTCG CGGAGAAGGC CGTCAACGTC
CGCTCGGAAG CGCGCTCGGC CTATCAGGCC TACCGCGCCA ACTACGACAT TGCCCGGCAC
TACCGCAAGA GCGTCGTGCC GCTGCGCACC AGGATCGCGG AGGAATCCCT CCTCACATAC
AACGCGATGA TCACCAACAC CTTCGAGCTG CTCGCCGACA GCCGCGAGAA GGTCAATGCG
AACCTGCTGG CCGTCAACGC CAAGCGCGAC TTCTGGCTGG CCGAAGCCAA TCTCGCCCCC
GCCGTCTACG GCGGCGGCGC GGGTGCCGCC GCCGTCGAAA TCGAAGTCGC GGCAGCCGCC
GAGAGCGGTG GTGGCGGCCA CTGA
 
Protein sequence
MMRANVKRAA ALALPLVLGG CVSASEYAAK NAGFSAVEAK TAEAVRKQTV WIQNQQHARV 
VSDRVKTLMA KKAIDVETAV QVALLNNRGL QAAYADLGDS AADAWQSTML VNPTVSVGLT
GIGTPGLEAF KSVEGMIANN ILALATRERD IAIADTRFRQ AQLNAALRTL QLAADTRRAW
INAVAAWETV AQLNQAQAAA DAASELAQEL GKSGALTKEG QAREHVFSAE LAGQTAKARL
EARLAKEELT RLMGLWGSGI DYQVPNRLPP LPKGIMKRDP IEAEALQRRV DLQMAKLDLE
ATARSYKLTE ATRYVTDLEL LTGFETEREL EDGDIRSETT GQAELEFVIP IFDSGRARMR
KAELAYMRAA NLLAEKAVNV RSEARSAYQA YRANYDIARH YRKSVVPLRT RIAEESLLTY
NAMITNTFEL LADSREKVNA NLLAVNAKRD FWLAEANLAP AVYGGGAGAA AVEIEVAAAA
ESGGGGH