Gene Smed_1129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1129 
Symbol 
ID5321975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1196783 
End bp1197784 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID640790070 
Productaliphatic sulfonate ABC transporter periplasmic ligand-binding protein 
Protein accessionYP_001326815 
Protein GI150396348 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAGGA ATACCGTGAC TTTTACCCGA CGCAGTTTTC TTGGTGCTGC CGCTGCCGGA 
GCGCTCGCAA CCCCGATGCT GGGTGTCGCC ACGCGACGCG GTTACGCTCA GGTCAAGCCC
GTAAGAATAG GTTACATCGC CGACTATTTC GGCACGAGCC TAACGGCCAT CGCCACCGAC
CAGAACTTGT GGGCCAAACA TGGCCTCGAG CCCGACTTGA AGGTGTTCAC CAACGGCCCG
ATCCAGATCC AGGCGTTGGG CGCGGGCAGC CTCGATTTCG GCTATGTCGG TCCCGGTGCC
CTTTGGCTTC CGGCCATGGG CAAGGCCAAA CTCGTCGCGA TCAATGCGCT CGGCCTGTCC
GACCGTGTGA TTGCGCAGAA AGGCATCAAC TCGGTGGCGG ATCTGAAGGG GAAAAAGGTC
GGCGTTCCGG AGGGCACGTC AGGTGACATG CTTTTGCGTC TAGGGCTCGC CAAGGCCGGC
ATGTCGATTT CCGACATAGA AGTGGTCAAA ATGGACCCAT CGACCGTTGT TGCGGCCTTT
GCGTCGAAGC AGATCGATGG CGCCGGCATC TGGTATCCCC TGGTTGGCAT TATCAGAAAG
ACCGTCCCGG ATCTGGTTGA AGTCGCCAAG AGCGACGAAT TCTATCCGGA AAATTCGTTC
CCGTCGGCCT TCGTCGCCCG AAATGAGGTG ATCACCGACG ACATCGACAT GGCCGGAAAG
TTCGTCGCCA CGATGAAGGA GGCCAACGAC TATCGCGCTG CCGACGTGCC GCGCTCGGTC
GAAATCACCG CCAAATTCCT CGGCGTGCCG ATAGAACCGC TTCAAGTCGA AAGCGAGAAC
GGCAAGTTTT TGACTTCGGA GGAGCTTGCC GCCGCCAGCA AGGATGGCAC GGTGGCCGGC
TGGCTGAAGG GACTCAATGA CCAGTTCGTT GCTTTCGGCA AGATGCAGGA CCCGCTCGAC
CCGGAGGATT ATTATCTCGC CGACCTTTAC GCGGGCAATT AG
 
Protein sequence
MRRNTVTFTR RSFLGAAAAG ALATPMLGVA TRRGYAQVKP VRIGYIADYF GTSLTAIATD 
QNLWAKHGLE PDLKVFTNGP IQIQALGAGS LDFGYVGPGA LWLPAMGKAK LVAINALGLS
DRVIAQKGIN SVADLKGKKV GVPEGTSGDM LLRLGLAKAG MSISDIEVVK MDPSTVVAAF
ASKQIDGAGI WYPLVGIIRK TVPDLVEVAK SDEFYPENSF PSAFVARNEV ITDDIDMAGK
FVATMKEAND YRAADVPRSV EITAKFLGVP IEPLQVESEN GKFLTSEELA AASKDGTVAG
WLKGLNDQFV AFGKMQDPLD PEDYYLADLY AGN