Gene Smed_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1397 
Symbol 
ID5322248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1472542 
End bp1474362 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content59% 
IMG OID640790339 
Productextracellular solute-binding protein 
Protein accessionYP_001327078 
Protein GI150396611 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.982775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.886089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTTGC GCGCAGTTCT TTCCGGCCTG GTGATGCTCG TGGCAGCCAG CATATTTGCC 
AACGGTGCTT TGGCCGCGCC CGTGCATGCG ATCGCCATGT ATGGCGAACC AGCCTTGCCA
GCCGACTTCA AGCACTTCCC TTACGTCAAC CCGGAGGTGA AGAAAGGCGG GAGGATTGCC
TATGGCGTCG TCGGCACGTT CGACAACCTC AACCCCTTCA TCCTGAAGAG CATGCGCACG
ACTGCGCGCG GCATGTGGGA TCCCGGTTTC GGCAATCTGG TCCACGAGTC CCTGATGCAG
CGCTCGCAGG ACGAGCCCTT CACCATGTAT GGCCTGCTCG CCGAGACCGT CGAATGGGAC
GATGACCGTA CTTTCATCCA GTTCAACCTC AATCCCAAGG CGCGCTGGGC CGACGGTCAG
CCGGTAACCG CAGAAGACGT GATCTTCACC TTCGAATTGA TGCGCGACAA AGGTCGCGCA
CCCTTCAGCA ACCGCCTCTC CAAGGTGGCG AAGATGGAAA AGGTCGGGGA GCGAAGCGTG
CGCTTCACTT TCACCGAGGA TGCCGACCGA GAGGTTCCGC TTTTGCTTGG GCTTTCACCC
GTGCTGCCGA AACATGCGGT CGACGTCGAG GCGTTCGATC GAACGAGCCT CAAACCGCCG
CTCGGCTCCG GCCCTTACCG CGTCGCGGAA GTCAGGCCCG GAGAACGAAT CGTCTATCGC
CGCAATCCCG ATTATTGGGC CAAGGATCTG CCGTCCAAGG TCGGTCTCGA CAATTTCGAC
GAGATCTCGG TTGAGTACTT TCTTCAGGAG AACACGCTTT TCGAAGCCTT CAAGAAGGGC
GTCGTAGACA TCTATCCGGA AGGAAGCGCC ACCAAATGGG CTCGCGCCTA TGATTTCCCC
GCGGTCCGCA GCGGCGACGT GATCAAGGAA ACATTCAAGC CGAAGACGCC CTCCGGAATG
CTCGGCTTTG TCTTCAACAC GCGCAGGCCT GTATTCGACA ACATCAAGCT CAGACAGGGC
CTGGCTCTCG TCTTCGACTT CGAGTGGGTC AACAAGAACC TCTTCGACGG TGCCTATACC
CGCACGCAAA GCTACTGGCA GAATTCGTCG CTTTCCTTCC TGGGCGTAGC AGCCGACAAT
CGTGAACTTG ACCTCATGGG AGATGTCAGG GAGCGCATCA ATCCGGCGAT CCTCGACGGC
AGCTATAGGC TCCCCGTTAC GGACGGCTCG GGTCGCGATC GCAACGTGCT TCGGGAGGCA
GTGACGCTGC TGCGTGAAGC GGGCTACTCG ATCAAGGACG GCAAGATGGT CGATGGCAAC
GGCACGCCGC TTGCTTTCGA GATCATGAGC CAGAATGCCG GACAGGAGAA GATCGCCCTC
GCCTACCAGC GCTTTCTGGC CCCGCTCGGC ATTGTTGCAA GGGTTCGGAC GGTCGATGAT
TCGCAATATC AGTTGCGCAG CCAGTCCTTC GATTACGACG TCATCATAAA GTCGTTTCCG
TCGTCGCTCT CCCCGGGTCT CGAACAGATC AACCGCTGGA GCTCGCGAAC ACGGGACAGG
CAAGGTAGCG ACAATTTCGC CGGGGTCGCC GACAAGGACG TCGACAAACT GATCAACAAC
ATACTCCAGG CACGAAACCC CGATGATTTC ACCGCCGCCG TTCGCGCGCA CGACCGGCTG
CTGGTCAACA ATTCCTATGT GGTTCCGCTT TACCACCTCG ATGCGCAATG GATCGCCCGG
TGGAAGCGTA TCGGCCGGCC AACGGCTGAG CCGCTCTATG GCTATCAGTT GCCGAGCTGG
TGGGACGAGC GGGTCCAGTA A
 
Protein sequence
MKLRAVLSGL VMLVAASIFA NGALAAPVHA IAMYGEPALP ADFKHFPYVN PEVKKGGRIA 
YGVVGTFDNL NPFILKSMRT TARGMWDPGF GNLVHESLMQ RSQDEPFTMY GLLAETVEWD
DDRTFIQFNL NPKARWADGQ PVTAEDVIFT FELMRDKGRA PFSNRLSKVA KMEKVGERSV
RFTFTEDADR EVPLLLGLSP VLPKHAVDVE AFDRTSLKPP LGSGPYRVAE VRPGERIVYR
RNPDYWAKDL PSKVGLDNFD EISVEYFLQE NTLFEAFKKG VVDIYPEGSA TKWARAYDFP
AVRSGDVIKE TFKPKTPSGM LGFVFNTRRP VFDNIKLRQG LALVFDFEWV NKNLFDGAYT
RTQSYWQNSS LSFLGVAADN RELDLMGDVR ERINPAILDG SYRLPVTDGS GRDRNVLREA
VTLLREAGYS IKDGKMVDGN GTPLAFEIMS QNAGQEKIAL AYQRFLAPLG IVARVRTVDD
SQYQLRSQSF DYDVIIKSFP SSLSPGLEQI NRWSSRTRDR QGSDNFAGVA DKDVDKLINN
ILQARNPDDF TAAVRAHDRL LVNNSYVVPL YHLDAQWIAR WKRIGRPTAE PLYGYQLPSW
WDERVQ