Gene Smed_1028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1028 
Symbol 
ID5321874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1098221 
End bp1099537 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content60% 
IMG OID640789971 
Producthypothetical protein 
Protein accessionYP_001326716 
Protein GI150396249 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000562968 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAATT TCATCGACCG CCGCCTCAAT CCGAAGGACA AGAGTCTCGG CAACAGGCAA 
CGTTTCCTGA AACGGGCGCG AGAGGAGCTT AAACGAACCA TCAAAGAACG GGTCAAGTCG
GGCAAGATCG CGGATGTGGA TGCGGAGCAG AACGTGTCCA TGCCGGCCCG CGGCGTCAAC
GAGCCGGCCT TCCAGCCGGA CTCCAACAGC GGCGAGCGGC GCCACGTCCT GCCGGGAAAC
CGGGAGTTCG CGGCAGGAGA CCGCATCCCG AAAAGGGGTG GAGGCGGCGG CGCCGGAAAT
GCGGGCGCCG GCACCGGCCA AAGCGAGGAC GAGTTTCAGT TCGTCCTTTC ACGCGAAGAG
GTGCTCGACC TCTTCTTCGA GGATCTCGAA CTCCCCGACA TGGTCAAGCT CAATCTGAAG
GAGTCGGTTA CGTTCAAGCG GCGACGAGCC GGCTTCAGCG CAAGCGGCTC TCCCACGAAC
ATCAATGTCG GGCGCACCAT GCGCAACAGC TATGGGCGCC GAATCGCATT GCGGCGGCCG
TCGCGCCGGG AAATCGAGGC CCTGGCCGAT GAGATTGCCA GGCTCGAAAC CGAGCCTGGC
GGGCGGAACA AGCATCGTCA GCGATTGGAG GAACTGCGAC AGACGCTCGA CAGTCTCGAG
CGACGGCGCC GGCGAATTCC CTATGTCGAT CCGGTAGACA TTCGCTTCAA TCGTTTCGAG
CCTCAGCCTT TACCGAATGC GAGCGCAGTC ATGTTCTGCC TCATGGATGT CTCGGCGTCG
ATGGGGGAGC GGGAGAAGGA CCTCGCCAAA CGTTTTTTCG TGCTGCTGCA TCTCTTCCTC
AAGCGGCGCT ACGAGCGGAT CGACATCGTA TTCATCCGGC ACACCGATGA AGCCGGCGAG
GTCGACGAGA ACACGTTTTT CTATAGCAAG CAGAGCGGCG GCACGGTCGT TTCCACCGCC
CTGGAGGAGA TGCTGCGCGT TATCAGGGAG CGTTACCCTG CCAACGAATG GAACATCTAC
GCCGCACAGG CGTCGGACGG CGAGAATATC TCAGGCGACT CCGAACGCTG CGCCTCCCTT
CTTCATGACG AGCTCATGGG ACTTTGCCAA TATTATGCCT ATGTCGAGAT CATCGATGAG
CGCGAGACGG AGATTTTCGG CACCACCGAC AACGGGACTT CGCTCTGGCG AGCCTACCGC
ATCGTCGATG GCGAATGGCC GAATTTCCAG ATGACCCGCA TCGCGAAACC GGCGGATATC
TATCCCGTCT TCCGAAAACT CTTCGGCAAG CAGCCGGAGA TGCAATTGCG CAAGTAA
 
Protein sequence
MPNFIDRRLN PKDKSLGNRQ RFLKRAREEL KRTIKERVKS GKIADVDAEQ NVSMPARGVN 
EPAFQPDSNS GERRHVLPGN REFAAGDRIP KRGGGGGAGN AGAGTGQSED EFQFVLSREE
VLDLFFEDLE LPDMVKLNLK ESVTFKRRRA GFSASGSPTN INVGRTMRNS YGRRIALRRP
SRREIEALAD EIARLETEPG GRNKHRQRLE ELRQTLDSLE RRRRRIPYVD PVDIRFNRFE
PQPLPNASAV MFCLMDVSAS MGEREKDLAK RFFVLLHLFL KRRYERIDIV FIRHTDEAGE
VDENTFFYSK QSGGTVVSTA LEEMLRVIRE RYPANEWNIY AAQASDGENI SGDSERCASL
LHDELMGLCQ YYAYVEIIDE RETEIFGTTD NGTSLWRAYR IVDGEWPNFQ MTRIAKPADI
YPVFRKLFGK QPEMQLRK