Gene Smed_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1990 
Symbol 
ID5322849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2040655 
End bp2041884 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID640790928 
Producthypothetical protein 
Protein accessionYP_001327659 
Protein GI150397192 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCATCAG GCACTGAGCG AGACGGTGTG AGGCCGAAGA ACGGCGCGGG CGTCGACGGC 
ACCGCCGCGG ACCTCCATGC CTCTATCGTC TCCGCGCGGA AGGTACCCTT TTCCGCCGAT
CCGGTTGAGG GGAACCTGGA ACGCTGCCTC GCCGCGCTTC CTCCGCTCTG CCGCCGCGCG
CGCATCACGC GGCTCGGCGA TCTGACCGGC CTTGACCGGA TCGGGCTGCC GGTGATGCAG
GCTGTCCGGC CTGCCGCGCT CTCCGAAGTT ACGTCTCTCG GAAGGGGCTT CTCCAAAGCG
GAGGCGGCAG TCGGCGCGCT GATGGAATCG CTCGAACGCT ATTTTGCCGA GTCCATTCCG
GCAGATCGGA CCTTTCTCGC GACCGCCGAC CAACTCGAAG TCACCAAGGG TCTCTTTGAG
AACCTCGTGG TTCCGGAACG GCGTGGAAAA TGGCGTCAAC AGGTCATTGC CTGGATCGAA
GGGATCGATG TCCTGAGCGG CTTAGTGCAG CCGGTGCCGC TGGAACTCGT GCATACCCGT
TACAGCGATC CGCCGCCGGC CCATGACGGC GTCTTCCTGC GCACGACCAC CGGCCTTGCC
TGCCATACCA GCCCCAATGG CGCTTTCCTG CACGGATTAT GGGAATGCCT CGAACGGGAT
GCGATCGCCC GTGCCTTTGC CACGCATGGC TTCTTCGATC GGATGCGGCT TGCGCCCTTT
GGCCTGGGGG ACAGGATTGA TCGTATTCGG TCGGTTGCGA GCGCTCGCGG CATCTCCTTC
GCCCTGTGGC TCGCTCCCTC TCCGGCATCC GTTCCCGTCG TCTGGTGTCA GACGATCGAG
ACTTCGCCGG GTGAGCCGAT ACTGGCGCTG CCGACGGAAG GTTACGCCGC GGGCCCGAGC
GTTGCAGCGG CGGCTGCAAG CGCAATGCTG GAAGCACTCT CGGCACGGGC AGGGGCGATC
TCTGGCGCCC GCGACGACCA GACGAGGGAG CACTATCGCA GGAGGACGGA CGGGGCGATA
GCGAAGGCCC GGGAGCTTAT TCTTGGCGAT CACGCTACAA GGTTCATGGA GACACCGACG
CTGACGCTCA CAAATTCCGG TGCGCTGGCA GGCCGCGTGA TCGATGCAGG GCTCGGACCG
GTGCTGGCCA TTTCCGTGGG TGCCGAAGGC GGTGTACATT GCGTGCGAAC CGTTCTTCCT
GGTGCCTCTC CCTTCTTCGT CTTGCGGTGA
 
Protein sequence
MSSGTERDGV RPKNGAGVDG TAADLHASIV SARKVPFSAD PVEGNLERCL AALPPLCRRA 
RITRLGDLTG LDRIGLPVMQ AVRPAALSEV TSLGRGFSKA EAAVGALMES LERYFAESIP
ADRTFLATAD QLEVTKGLFE NLVVPERRGK WRQQVIAWIE GIDVLSGLVQ PVPLELVHTR
YSDPPPAHDG VFLRTTTGLA CHTSPNGAFL HGLWECLERD AIARAFATHG FFDRMRLAPF
GLGDRIDRIR SVASARGISF ALWLAPSPAS VPVVWCQTIE TSPGEPILAL PTEGYAAGPS
VAAAAASAML EALSARAGAI SGARDDQTRE HYRRRTDGAI AKARELILGD HATRFMETPT
LTLTNSGALA GRVIDAGLGP VLAISVGAEG GVHCVRTVLP GASPFFVLR