Gene Smed_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4226 
Symbol 
ID5319300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp708535 
End bp710073 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content63% 
IMG OID640776031 
Producthypothetical protein 
Protein accessionYP_001312964 
Protein GI150376368 
COG category[S] Function unknown 
COG ID[COG3333] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0723732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA TCGGTCTCCT GATCGATGGC TTCGGCCACA TCCTCAGCTG GAACCATATC 
CTGCTGATGG TCGTCGGCGT GACGCTCGGC ATCCTCGTCG GGGTGCTGCC GGGGCTCGGG
GCGCCAAACG GCGTGTCGCT GCTCCTGCCG CTCACCTTCT CCATGGACCC CATCTCGGCG
ATCATCCTCT TGTCCTGCAT GTATTGGGGC GCGCTCTTTG GCGGCTCGAC GACGTCGATC
CTCTTCAACA TTCCGGGCGA GCCCTCATCC GTCGCGACCA CATTCGACGG CTATCCAATG
GCGAAAGCCG GTCATGCGAG CCGGGCGCTC ACACTCGCCT TCGTTTCTGC CGGCCTCGGC
GCGCTTGCCG GGGTCGTCAT GATCACGCTG CTTTCGGGCT GGGTGGCGAA CTTCGCACTC
AAATTCTCCT CGCCGGAATA TTTTGCCGTT TATTTTCTCG CCTTTGCGAG CTTCATCTCA
ATGGGCGCGC AGTCGCCTTT CAAGACGCTC GTGTCGATGA TGCTCGGCTT CGCTCTCGCC
TCCGTCGGCA TGGATACGAT CTCGGGCAAT CTGAGGCTCA CCTTCGACAT TCCCGAACTG
ATCAAGGGCG TCAGCTTCCT CATCGCCGTC ATGGGACTCT TCGGTATCGG CGAACTTCTG
CTGACGACGG AAGAGGGGCT GCGCTTCGAA GGCATCAAGG CGCGGGTGCG GCTGTCCGAA
ATCGGCAGGA CGCTGATCGA GATCCCACGC TATTGGCTGA CGATCGCCCG CTCGACGATT
ATCGGCATCT GGATGGGGAT CACGCCGGCC GGCCCGACCG CCGCCTCCTT CATGAGCTAT
GGCGTTGCCC GGCGCTCGGC GCGCGACAAT TCGATGTTCG GCAAGGGCGA TCCGCGCGGC
ATCGTCGCGC CCGAGACGGC CGACCATTCC GCCGGCACTT CGGCCCTGCT GCCGATGCTG
GCGCTCGGCG TCCCGGGTTC CGCCACCGCC GCGGTGATGA TGGGCGGGTT GATGATCTGG
GGCCTGACGC CCGGTCCGAT GCTCTTCACC GATCGCCCCG ACTTCGTCTG GGGCCTGATC
GCCTCCATGT ATCTCGGCAA TGTCGTCGCT GTCTTTCTCG TGATCGCGAC GGTGCCGCTT
TACGCCTCCA TCCTGCGTGT GCCGTTCTCC ATCATCGGAC CGATCATCGT CGCGGTCATC
TTCTCAGGAG CTTACCAGGT CGCAAACTCC GTTTCGGACA TCTTCATGGT GATCGGCTTC
GGTCTTCTCG GCTACGTCTT CAAAAAGCTC GACTATCCGC TGGCGCCGCT GGTCCTCGCC
ATGGTGCTCG GTGACAAGGC AGAAGACGCC TTCCGCCAGT CGATGCTGAT GTCGGGCGGC
AGCCTGAACA TCTTCTGGTC GAATGGCCTT GTCTCCGCCC TGATGGCGGT TGCCCTTGCG
CTGCTTCTTT CGCCGCTCGC CTTCTTGCTG ATCGGCAGCG TGCGAAAACG CAAGAACGAG
GTGGTGGCGC CCGGCGGTGA CGGCAGTCCG GCAGGCTGA
 
Protein sequence
MENIGLLIDG FGHILSWNHI LLMVVGVTLG ILVGVLPGLG APNGVSLLLP LTFSMDPISA 
IILLSCMYWG ALFGGSTTSI LFNIPGEPSS VATTFDGYPM AKAGHASRAL TLAFVSAGLG
ALAGVVMITL LSGWVANFAL KFSSPEYFAV YFLAFASFIS MGAQSPFKTL VSMMLGFALA
SVGMDTISGN LRLTFDIPEL IKGVSFLIAV MGLFGIGELL LTTEEGLRFE GIKARVRLSE
IGRTLIEIPR YWLTIARSTI IGIWMGITPA GPTAASFMSY GVARRSARDN SMFGKGDPRG
IVAPETADHS AGTSALLPML ALGVPGSATA AVMMGGLMIW GLTPGPMLFT DRPDFVWGLI
ASMYLGNVVA VFLVIATVPL YASILRVPFS IIGPIIVAVI FSGAYQVANS VSDIFMVIGF
GLLGYVFKKL DYPLAPLVLA MVLGDKAEDA FRQSMLMSGG SLNIFWSNGL VSALMAVALA
LLLSPLAFLL IGSVRKRKNE VVAPGGDGSP AG