Gene Smed_4163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4163 
Symbol 
ID5319192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp637752 
End bp638837 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content62% 
IMG OID640775968 
ProductABC transporter related 
Protein accessionYP_001312901 
Protein GI150376305 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.308021 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.489611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAACG GCACCGGCAA TAAGAGCGTC GTGCTCAGCG ATATACGCAA GAGCTACGGC 
AACCTGGAGG TCATCCATGG GATCGACCTG ACGATCGAGG AAGGCGATTT CGTCGTCTTT
GTCGGCCCGT CCGGCTGCGG GAAATCGACC CTGCTGCGGA TGATCGCAGG GCTCGAGGAG
GTGACGGAGG GAGAGATCGC AATCAAGGGC AGGGACGTGA CAGATCTCGA CCCTTCCGAG
CGCGGCATAG CTATGGTCTT CCAGTCCTAC GCGCTCTACC CGCATATGAG CGTTGGCGAA
AATCTCGGCT TCGGACTCAA GATGGCTCGG ACGGATCCGG CCGATATCGA GCGCCGCGTC
GCGCAGGTCT CGGCGATCCT AAAAATCGAT CCCCTGCTTG ACCGAAGGCC GGGCCAACTT
TCGGGCGGGC AGCGCCAGCG CGTGGCGATC GGCCGGGCGA TCGTGCGCAA GCCGGACGTA
TTTCTGTTCG ACGAGCCGCT TTCCAACCTC GATGCCGAGC TGCGCGTTTC GATGCGCATC
GAGATCGCAC GGCTCCACCG CGAACTCGGC AATACCATGA TTTACGTCAC CCACGATCAG
ACGGAGGCGA TGACGCTCGC CGACCAGATC GTCGTTCTGC GTGACGGCCG GATCGAGCAG
ACCGGCAGCC CCCGCGAGGT TTACGAGGAT CCCGCCAACA TGTTCGTTGC GGGTTTCATC
GGTTCGCCGC GCATGAATTT CCTTGAGGCC GAATGGCAGG GAGACGGAAC CGTCCACATC
GGCAGCGCAG CGCTTTGGGC TCCGATCGAT GGCAGCAGTC TGGCTCCGGG GGATCGCCTG
AGGCTCGGGA TGAGGCCGGA GCACCTCACG GTCTGCGAAC CCGGCGCGGA GCGGATCGCC
GCCCAGGTCG AATTCTCGGA ATATCTCGGC GGCACGCGCT ATCTCTATTG CCAACTTGAG
GACGGTCAAA GCCTTGTCGT CGAACAACGC GAAGGGCCGG ACTGGCAGGT GGGGGAGAGG
CTATCGTTTT CCGTGCCAGA GGACAGAAGA CGGTTCTTTG CCGGGGACGG CCGACGTTTG
CGTTAG
 
Protein sequence
MANGTGNKSV VLSDIRKSYG NLEVIHGIDL TIEEGDFVVF VGPSGCGKST LLRMIAGLEE 
VTEGEIAIKG RDVTDLDPSE RGIAMVFQSY ALYPHMSVGE NLGFGLKMAR TDPADIERRV
AQVSAILKID PLLDRRPGQL SGGQRQRVAI GRAIVRKPDV FLFDEPLSNL DAELRVSMRI
EIARLHRELG NTMIYVTHDQ TEAMTLADQI VVLRDGRIEQ TGSPREVYED PANMFVAGFI
GSPRMNFLEA EWQGDGTVHI GSAALWAPID GSSLAPGDRL RLGMRPEHLT VCEPGAERIA
AQVEFSEYLG GTRYLYCQLE DGQSLVVEQR EGPDWQVGER LSFSVPEDRR RFFAGDGRRL
R