Gene Smed_5863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5863 
Symbol 
ID5320165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp824684 
End bp825760 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content60% 
IMG OID640777558 
ProductABC transporter related 
Protein accessionYP_001314490 
Protein GI150377895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0374119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATA TCGAACTCAA GGGCATCAAC AAGGCTTTCG GTACCCACAC CGCCCTTAAG 
GATCTCAGTT TCGAGATAGC CGACGGAGAG TTTTTTGTGC TGCTGGGCGA AACGGGCGCA
GGCAAGACGA CGACCCTCAG GCTGATTGCC GGCCTTGAAA AGCCAACGGG CGGGCAGATC
TTTATCGACG GTGAGGATGT CGCCGACTGG GGCGCGGCAG AGCGCGACGT TGCGCTCGTC
CTGCAGCAAT ATTCCCTTTA TCCGCGCTAC ACGGTTCGCG AGAACCTCGA ATTCCCGCTC
AAGGCTCGCA TCCGGCGTGT TGAGCCAGCC GAGATCAAAG AGCGGGTCGC CCGGGTGGCG
AGGACACTCC GCATCGAGCA TCTGCTTGAC CGCAAAACGG ACCGCCTATC TGGTGGCGAG
ATGCAGCGCG TTTCAATCGG CCGGGCCATC GTACGCAAGC CCCGCGTCTT TCTGATGGAC
GAGCCTCTTT CCGCTCTCGA CGCCAAACTG CGGGAGGCGC TGCGAACGGA GCTCAAGAAT
CTCCAGATGA ATCTCGGGGC GACCTTCCTC TTCGTGACCC ACGACCAGAT CGAAGCCATG
TCGATGGGCG ACAAGATCGG CGTCCTCAAC AACGGCCAAC TCGTTCAATC TGGCACACCG
CACGAGATCT ATCGCAATCC GGTCAACACT TTCGTTGCAC GCGCCGTAGG CTCGCCGCCG
ATGAACCTGT TCTCAGGAAA ACTTTCGGGA TGCGAGGCGA TCGCAGACGA AGGCTATCGC
TTACCATTCG GGGCGGCGTT CGGCCCGTCG GCTGAGGGTC AGCGCCTCAC CTTCGGGATC
CGTCCTGAAG ATCTCTTCCT TGAAAGCGGC GCACCCGCCG AAGCGCGCGT CCACGGCGTC
GAAAACCACG GCGTCGAGAA GATCGTTACG CTTCGCACAG GCAATCACTT CCTGCAGGCG
ACGGTTCCGG CTCAGATGGA CCTTGCGATC GAGGAGGCTG TTCGCTTCTC CTGGAACCCT
GATAAAGTCG TACTCTTTGA CGGCGGAAGC GGGAAGAGCC TGCGCCACGT TGGCTAA
 
Protein sequence
MAHIELKGIN KAFGTHTALK DLSFEIADGE FFVLLGETGA GKTTTLRLIA GLEKPTGGQI 
FIDGEDVADW GAAERDVALV LQQYSLYPRY TVRENLEFPL KARIRRVEPA EIKERVARVA
RTLRIEHLLD RKTDRLSGGE MQRVSIGRAI VRKPRVFLMD EPLSALDAKL REALRTELKN
LQMNLGATFL FVTHDQIEAM SMGDKIGVLN NGQLVQSGTP HEIYRNPVNT FVARAVGSPP
MNLFSGKLSG CEAIADEGYR LPFGAAFGPS AEGQRLTFGI RPEDLFLESG APAEARVHGV
ENHGVEKIVT LRTGNHFLQA TVPAQMDLAI EEAVRFSWNP DKVVLFDGGS GKSLRHVG