Gene Smed_3891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3891 
Symbol 
ID5318685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp349107 
End bp350192 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content64% 
IMG OID640775703 
Producthypothetical protein 
Protein accessionYP_001312636 
Protein GI150376040 
COG category[S] Function unknown 
COG ID[COG4641] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.172461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTGG CATTTTACGG ATCGAGCCTC GTTTCGGCCT ACTGGAACGG CGCGGCCACC 
TACTATCGCG GCCTGCTCCG TGCTCTCGCG CAACGGGGCT ATCAGATCAC CTTCTATGAG
CCTGATGTCT ACGACCGGCA AATGCACCGC GACATCGATC CGCCCTTCTG GTGCAAGGTC
GTCGTCTACG AAGGGACCAT TGAAGGTCTG AAAAGCGTCG CCGGGAAAGC CGGTGAGGCC
GACATCGTCG TCAAGGCGAG CGGAGTGGGC TTTGAGGACG AGTTACTGCT CGCCGAGGTC
ATGGCGGCGG CGGATCCAGC GGCATTGAAG ATCTTCTGGG ACGTCGATGC ACCTGCCACG
CTGGCCGATC TCAGGGCCGC GCCCGACCAC CCGCTTCGTC GCGCGCTTCC CTCTCTGGAT
CTCGTTCTGA CCTATGGCGG CGGCGATCCC GTGGTCGGCG CCTATCGGGC TCTCGGTGCC
CGCGAATGCG TTCCGATCTA CAATGCCGTC GATCCCGAGA CGCACTATCC GGTTTCGCCG
GACCCGCGCT TTAACGCGGA TCTCGCCTTC CTCGGCAATC GCCTGCCGGA CCGGGAGGAG
CGGGTGGAAG CCTTCTTCCT CGAGCCGGCG CAAAAGCTCT GGCAGCGGCG TTTCCTGCTT
GGCGGAGCGG GTTGGCATGA CAAATCCCTG TCCCCGAACG TCGCCTATAT CGGTCATGTC
CCGACGGCGG ACCACAATGC CTTCAACACG ACACCGACTG CCGTATTGAA CATTTCGCGC
TCCAGCATGG CCGATAACGG CTTTTCGCCG GCAACCCGCG TATTCGAGGC GGCCGGCGCC
GGTGCCTGCC TGATCACCGA CTACTGGGAG GGGATCGAAC TTTTCCTGAA GCCCGGAGAA
GAAGTGCTGG TCGCCAGAGA CGGCCGCGAC GTCGCCGAGC TGATGCAGAA ACTGACCAGC
GCTAACGCCA GGGAGATCGG CGAGCGCGCG CTGCGCCGCG TGCTGGCCGA GCACACCTAT
GCGCATCGTG CCGCAGAAGT GGACCGCATC TTGCGCCAGG CGCGCCGGAT GGAGGCTGCC
GAATGA
 
Protein sequence
MKLAFYGSSL VSAYWNGAAT YYRGLLRALA QRGYQITFYE PDVYDRQMHR DIDPPFWCKV 
VVYEGTIEGL KSVAGKAGEA DIVVKASGVG FEDELLLAEV MAAADPAALK IFWDVDAPAT
LADLRAAPDH PLRRALPSLD LVLTYGGGDP VVGAYRALGA RECVPIYNAV DPETHYPVSP
DPRFNADLAF LGNRLPDREE RVEAFFLEPA QKLWQRRFLL GGAGWHDKSL SPNVAYIGHV
PTADHNAFNT TPTAVLNISR SSMADNGFSP ATRVFEAAGA GACLITDYWE GIELFLKPGE
EVLVARDGRD VAELMQKLTS ANAREIGERA LRRVLAEHTY AHRAAEVDRI LRQARRMEAA
E