Gene Smed_4645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4645 
Symbol 
ID5319290 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1154796 
End bp1155929 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID640776443 
ProductABC transporter related 
Protein accessionYP_001313375 
Protein GI150376779 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.396966 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0568321 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGACAGC TCACTCTCAA CAAGGTTCAG AAATCCTACG GCACCTACGA GGTGTTGAAG 
AGCATCGAGC TCGAAGTCGG AAATGGCGAA TTCGTGGTCT TCGTCGGACC TTCCGGCTGC
GGCAAATCAA CACTGCTCAG GATGATCGCG GGGCTTGACG AGACGACCGC GGGTGACATC
GTCATCGATG GCAAGCGTGT TAACGATCTG CCGCCCGTCA GGCGCGGCAT AGCCATGGTC
TTCCAGTCCT ATGCTCTATA CCCGCATATG AGCGTGTTCG AGAACATCGC CTTTCCTCTA
CGGGTCGAGA AGATGCCCGA GGAGAAACTG AAGGCGAAGG TTCAGCATGC CGCCCGCATA
TTGCACCTCG ATCAGCGGCT CGAGCAGAAG CCGGGCATGC TGTCGGGCGG GCAGCGTCAA
CGTGTGGCGA TCGGCCGGGC AATCGTGCGC GAACCGAAGA TCTTCCTGTT CGACGAGCCG
CTGTCTAACC TCGATGCTGC CTTGCGCGCC GATATGCGCA TTGAGCTCGC GAAGCTGCAC
AGGCATTTGA AGGCGACGAT GATCTACGTC ACGCACGACC AGGTCGAGGC GATGACGATG
GCGGACCGGA TTGTCGTGCT GAACGCCGGA GAGATTGCGC AGACGGGAGC GCCGCTCGAG
CTTTATCACA AACCCGCAAA CATATTCGTC GCAGGATTTA TCGGAAACCC CAAGATGAAC
TTTCTGCCGG TCACCTGTAC AGGTGTAAAC GATGCCGGTG TGGAAGTGGA CTACAAGGGA
CAGACGATTC TCGTTCCGGT CGTACCGCGC GCGGGCATGA CCGGGCGAAC CCTGACGCTC
GGGGTGCGGC CGGAACATAT CCGGATGGGC GACGCCGACC TGACGCTGAC GGTGACCCCC
TCGGTCATCG AGCGTCTCGG CGCCCATACA GTGGCCTATG TGGCGCTTGA CGGGGAAGGG
GAGAACTATT GCGCCATGCT GCCGGGGACA CTCGCGATCC GCGCCGACCA ACGGGTCAAG
ACCGGCATCG GTGCCATCGA CTGCCACCTC TTCGACGAAA AGGGGATGGC CTTCGAGCGG
CGGGTAGAGA TGACCGACAT CGATATGTCG CACTTCGATC CGGCGGCGGC TTGA
 
Protein sequence
MGQLTLNKVQ KSYGTYEVLK SIELEVGNGE FVVFVGPSGC GKSTLLRMIA GLDETTAGDI 
VIDGKRVNDL PPVRRGIAMV FQSYALYPHM SVFENIAFPL RVEKMPEEKL KAKVQHAARI
LHLDQRLEQK PGMLSGGQRQ RVAIGRAIVR EPKIFLFDEP LSNLDAALRA DMRIELAKLH
RHLKATMIYV THDQVEAMTM ADRIVVLNAG EIAQTGAPLE LYHKPANIFV AGFIGNPKMN
FLPVTCTGVN DAGVEVDYKG QTILVPVVPR AGMTGRTLTL GVRPEHIRMG DADLTLTVTP
SVIERLGAHT VAYVALDGEG ENYCAMLPGT LAIRADQRVK TGIGAIDCHL FDEKGMAFER
RVEMTDIDMS HFDPAAA