Gene Smed_4768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4768 
Symbol 
ID5318492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1290029 
End bp1291540 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content62% 
IMG OID640776566 
ProductABC transporter related 
Protein accessionYP_001313498 
Protein GI150376902 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.957059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.347139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATGGTT CCGCCGCGGA CACCATCCTC AAGATCACCG ACGTTACCAA GTCCTTCGGG 
CAGGTCGCTG CCCTCAAGGG GATGCGGCTC GAAGTACGGC GCGGCCGGGT GCACACACTG
CTCGGTGAGA ATGGTGCCGG CAAATCGACA TTGATGAAGA TTCTCGCCGG CGTGCACACG
GCAACCTCGG GAGAAATCAC CCTCGACGGG CAGGCTTACC GCCCCGCGAG CCCGCAGGAT
GCGGCCTCGC TCGGACTTGC CATCGTCTTC CAGGAATTGA GCCTCTGCAA CAATCTCACG
GTGGCGGAGA ATATCCTCGC CACGCGCGAG CCACGTCGTT TCGGATTCAT CAACGACAAG
GCGCTCGTCG CACAGGCGCG CCGGATCGTG GCCGACCTTG GACTCCCGAT CGACGTCACC
GAGAAGGTCG GCAATCTCTC GATCGCCCAA CGGCAGCTCG TCGAGATCGC CAAGGGCCTG
AGCCACGACG CGGAGGTCGT CATTCTGGAT GAACCGACCT CCTCCCTCAG TGACAGCGAG
GCGGAGATCC TGTTCGCGAT CATCGCCCGG CTGAAAGAGC GTGGCGTTGC GATCATATAC
ATCTCGCACC GCATGGAAGA GATCATGCGG CTCAGCGACG ACATCACGGT CATACGCGAC
GGCGAGTATG TTTCCACGCA TGCGCGCGAA GAGGTGACCA TCGAGGCACT GATCGCCCTG
ATGGTCGGGC GACGCATGGA CGAGATCTAC CCGCCGGCGG TTCATGCGGT CGCAAGGGAT
AAGGCCCCTG TCCTTTCAGT CGAGCGCTTG ACACGCGAAG GCGAGTTTCA GGACGTCTCC
TTCGAGGTAC GCGCTGGCGA GATCCTGGGC TTCTTCGGCC TTGTCGGATC GGGCCGCTCG
GAAGTGATGA ACGCGATCTT CGGCATGAAA AACGCCAGCG GCGCTGTGCG TCTCAACGGC
GAGGTCGTGC GGTTCCGCTC GCCGGACGAA GCCATCGCCC GGCGCGTCGG CTTCGTGACA
GAGAACCGAA AGGAAGAAGG TCTCGTCCTC GGCCACAGCG TCGAGTGGAA CATATCCATG
GCTGCACTCG GGGACTTCAC CGGTGGTTTC GGTTTCATCC GCAACGGCGC GGAACGAGCC
GCGGCATCCG CACAGGTCGG CAATCTCTCG ATCAAGACGA ACTCGCTCGA CACGCCATCC
GGTGCGCTCA GCGGCGGCAA TCAGCAGAAG ATCGTGATTG CCAAATGGCT TCTCACGCGG
CCCAGAGTGC TGATCCTCGA TGAGCCGACC CGCGGCGTCG ACGTCGGAGC CAAGTTCGAA
ATCTACAAGA TAATCCGTCA GCTGGCAGCG GAGGGAACGG CAATTCTGTT GATCTCCTCC
GATCTGCCCG AAGTTCTGGG AATGAGCGAC CGCGTTGTCG TCATGCATGA GGGCGCGCCG
GGAGCGACGC TCGAAGGCCC CGACCTCACT CCAGAGACGA TCATGGCTCA CGCGACAGGT
TTTCAATCAT GA
 
Protein sequence
MHGSAADTIL KITDVTKSFG QVAALKGMRL EVRRGRVHTL LGENGAGKST LMKILAGVHT 
ATSGEITLDG QAYRPASPQD AASLGLAIVF QELSLCNNLT VAENILATRE PRRFGFINDK
ALVAQARRIV ADLGLPIDVT EKVGNLSIAQ RQLVEIAKGL SHDAEVVILD EPTSSLSDSE
AEILFAIIAR LKERGVAIIY ISHRMEEIMR LSDDITVIRD GEYVSTHARE EVTIEALIAL
MVGRRMDEIY PPAVHAVARD KAPVLSVERL TREGEFQDVS FEVRAGEILG FFGLVGSGRS
EVMNAIFGMK NASGAVRLNG EVVRFRSPDE AIARRVGFVT ENRKEEGLVL GHSVEWNISM
AALGDFTGGF GFIRNGAERA AASAQVGNLS IKTNSLDTPS GALSGGNQQK IVIAKWLLTR
PRVLILDEPT RGVDVGAKFE IYKIIRQLAA EGTAILLISS DLPEVLGMSD RVVVMHEGAP
GATLEGPDLT PETIMAHATG FQS