Gene Smed_0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0916 
Symbol 
ID5321757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp987231 
End bp988253 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content61% 
IMG OID640789856 
Productputative periplasmic binding ABC transporter protein 
Protein accessionYP_001326606 
Protein GI150396139 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAA AACTATCTTC AGGGGCGGGA TCGGCCGGTG TCGGCAGGCG TCTTTTCTTG 
AAATCCGCAG CAATCGGCGG CGCTGCCGCA GCCGGCGGAC TTGCAGCTCC GGCGATCGCT
CAGGGCGCTA AACGGAAGGT GATCTTTGTC GCCCATGAGG ACATCCCCTT CTTTTCCCCG
GTTCGTGCCG GGTTCAAGGA ATTCGGCAAG CTGCGAAACT GGGACACGCA GTTTCTCGCC
CGTGGCACAC CGGCCAACGT CGCCGCAACC GTGCGGTTGC AAGTGGATGC GCTGAACTCC
AGGCCGGACG CGGTCGGCTT TACCCGCATC AATGAAACCG CTTTCGATGA GAATATCATG
CGGGCAAAGG ACGCCGGCAT TCCGATCGTG CTCTACAACG TGGCAAGCGA CGGCTACGAA
AAGCTCGAAG TGCCTTTCGT CGGTCAGGAC TTCATTCCCG CCGGCCGCGT GAACGGCCTT
CAGGCGGCCA TGTACGCGCA TCAACTGACC GGCAAGACCG AAGGCACGAT CCTGATCGAC
AATCCTTCTC CCGGCGTCAG CGCGCTGGAA GAACGGGCGA CCGGCACGGA GCAGGGGATC
GACGAATACA ACGGGAAGAA CGGCACCAAC TACAAGTACG AGACATTTAC CACCGCAAAC
TCGCAGACCG AAGCGCTGTC GCGGATCGAT GCCAAGATGC GCGCGACGCC GGACGTGGTC
GGTTTCGCCA GCACTGTTTC CGGAAACTGG TTCGCAGCAA TCTGGGCCGA AGATAACGGA
ATGACCGGCA AGTTCGCCAA TGGGGGCTTC GACCTTATCC CCGGCGTTCT GGAGGCGATC
GCGGCAGAAA CGTCCCACTG GGCGGTCGGA CAGAACCCCT ATGCTCAGGG CTGGGTCACC
TCGTCACTGC TGGATATGCA GCTTGAGGCC GGATACCAGC CATTCGATTA CGATACCGGC
GCGGAAGTCG TCGACAAATC CAACGTCGAG GCCGTGACCA AGCGCGAAGC GCGTTTCGGG
TGA
 
Protein sequence
MNKKLSSGAG SAGVGRRLFL KSAAIGGAAA AGGLAAPAIA QGAKRKVIFV AHEDIPFFSP 
VRAGFKEFGK LRNWDTQFLA RGTPANVAAT VRLQVDALNS RPDAVGFTRI NETAFDENIM
RAKDAGIPIV LYNVASDGYE KLEVPFVGQD FIPAGRVNGL QAAMYAHQLT GKTEGTILID
NPSPGVSALE ERATGTEQGI DEYNGKNGTN YKYETFTTAN SQTEALSRID AKMRATPDVV
GFASTVSGNW FAAIWAEDNG MTGKFANGGF DLIPGVLEAI AAETSHWAVG QNPYAQGWVT
SSLLDMQLEA GYQPFDYDTG AEVVDKSNVE AVTKREARFG