Gene Smed_2066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2066 
Symbol 
ID5322925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2117962 
End bp2119011 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content59% 
IMG OID640791003 
ProductABC transporter related 
Protein accessionYP_001327734 
Protein GI150397267 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.626309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0289677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCGG TCGAAATTCA GAGTGTCAAA AAGTTCTACG GCGCATTGCA GGCGTTGCAC 
GGGGTTTCAA TCCAGATCGA AGACGGAGAG TTCGTCACGC TGGTCGGCCC TTCCGGTTGC
GGAAAATCAA CCCTTCTGAG GATGCTCGCC GGGCTCGAGG AAATCAGCAG CGGGACGATC
CGCATCGGCG CGGCGGTCGT CAACGACCTC CCGCCGAAGG ACCGCGACAT TGCAATGGTC
TTTCAGAACT ATGCCCTCTA TCCTCACATG ACGGTCGCCG AGAACATGGG CTTTGCCTTG
AAGCTCAAGA ATGCCGACAA AGGCGAGATC CGCTCCAAGG TCGAGCGTGC GGCGAATATC
CTCAATCTCG ACAAGCTGCT CGATCGTTAT CCGCGCCAAC TCTCGGGAGG GCAGCGCCAG
CGGGTCGCCA TGGGCCGGGC GATCGTGCGC GCGCCCAAGG TCTTTCTTTT CGATGAGCCG
CTTTCCAATC TCGACGCGAC GTTGCGGGTC TCGATGCGCG CCGAGATCAA GAGCCTGCAT
CAAAGGCTCG GCACGACCAT TGTCTATGTG ACCCATGATC AGGTCGAAGC TATGACGATG
GCCGACAAGA TCGTGGTGAT GCGCGATGGG ATCGTAGAGC AAGTCGGGGC GCCGCTGGAG
CTTTATGACA GGCCATCCAA CATGTTCGTT GCCGGCTTCA TCGGGTCGCC AGCGATGAAC
TTTCTGACGG GCGATATCCG CGCGAATGGA TTTATGACCG GCACTTGTCT GTTTCCCATT
GGTGAAAATC GGCCTGATCT GCATGGCCGG AGCGCCGTGT ACGGAATACG TCCCGAGCAT
CTGCGCATCT CGGAGGACGG CATTCCCGCC GAGGTTCAAC TGGTCGAGCC AACCGGGTCC
GAGTCACACC TGATCGTCAA AATCGCGGAT CAGGCAATCA CTTGCGTGGT GCGGGACCGC
GTGGACGTCA GACCCGGCGA TTTAATCCGG CTGTCTCCCG ACGCGGATCG CGTTCACCTG
TTCGATCCTG ATGGAGAGAA CCGGCTCTAG
 
Protein sequence
MASVEIQSVK KFYGALQALH GVSIQIEDGE FVTLVGPSGC GKSTLLRMLA GLEEISSGTI 
RIGAAVVNDL PPKDRDIAMV FQNYALYPHM TVAENMGFAL KLKNADKGEI RSKVERAANI
LNLDKLLDRY PRQLSGGQRQ RVAMGRAIVR APKVFLFDEP LSNLDATLRV SMRAEIKSLH
QRLGTTIVYV THDQVEAMTM ADKIVVMRDG IVEQVGAPLE LYDRPSNMFV AGFIGSPAMN
FLTGDIRANG FMTGTCLFPI GENRPDLHGR SAVYGIRPEH LRISEDGIPA EVQLVEPTGS
ESHLIVKIAD QAITCVVRDR VDVRPGDLIR LSPDADRVHL FDPDGENRL