Gene Smed_3638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3638 
Symbol 
ID5318180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp72174 
End bp73700 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID640775451 
ProductABC transporter related 
Protein accessionYP_001312384 
Protein GI150375788 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCACG GTAGCATGCA TCAGTCGGCA AGCGCGGCAC GTACGGACCA GCCGCTTCTG 
TCTCTCCGAA ACATCAATAT GACCTTCGGC GGCGTCAAGG CGCTTAAGAA TGTGACCTTC
GAGGTCCGAC CAGGCGAGGT GCATTGCCTC GCCGGTGAGA ACGGCTCCGG CAAGAGCACG
CTGATCAAGG TGATCACGGG CGTCTATCGG CCAGCGGAAG GGGCGATCAT CGAATATGAC
GGTGATGTCT ATCCGCATAT GTCGCCGGTC ACCGCGCAAG AGCGCGGCAT TCAGGTCATC
TGGCAGGACC TTGCACTTTT CCCGGAAATG AGCGTCGCGG AGAACATAGC GTTCCACGAA
GTCCTTGGCC GTCCGCGGCT GGTCGATTAC AGCCGCATGC GCCAGATTGC GATCGAAGCG
CTGAGCCGGC TCGGCATCAC GCTCGATGTG GATCTGCCGC TCAAGGAATA TGCGATCGCC
CAGCGTCAGA TCGTGGCGAT CGCCCGAGCG CTCGTCGGTG AGGCGAAAGT TGTCTTCATG
GATGAGCCGA CGGCGTCGCT GACGCAGTCG GAGACGGATT ATCTCCTCGA GATCGTTCGC
GGCCTGTCGG CCTCCGGCGT TGCGGTTGTC TTCGTCAGCC ATCGTCTGGC GGAGGTTCTT
GAGATTTCGA GCCGGATCAC CGTTCTGCGC GACGGTGCGC TTGTCGGCGT GTATCCCGCC
GACGGCATGA CGCAGTCGAA AATCACTGAA CTCATGACCG GCAAGACCTT CGATCAGCAC
GTGCGCGCAC GCGCGAAGGA CGACCAGCCG GTCGTGCTCG ACGTTCGCGG TCTTGGCAGT
CCGGGCCAGT TCGAAGATGT TTCACTGACC GTCCGCCGTG GTGAGACGGT GGGGATAACC
GGTCTCCTGG GGGCCGGGCG GACCGAATTG GCGCTCGCGC TTTTCGGCAT GCTGAAGCCT
ACATCTGGGA CGTTCAGCAT CGATGGCCGG GAGGCTCGCT TCGCTTCGAA CCGCGATGCG
ATCAAGGCCG GCGTCGCCTA TCTGTCGGAG GACCGATTAT CGCTCGGGCT CATTCAGCCG
CAGTCGATCG CCGACAATCT CGTGATCGCA TCGCTTCACA AGATTCTCTC CGGCGGCCTT
CTCGCCGATG ACCGTAAACG CAGCCTCGTC GCCCGCTGGA TCGCCGATCT CGGCGTCAAG
ATCGGCCATC AGGCCGACGC GATATCGACG CTTTCCGGCG GCAACCAGCA GCGGGTGGCG
ATCGCCAAAT GGCTGGCCAC CGATCCCAAG CTTTTGATCC TCGACTCCCC CACGGTCGGG
GTCGATGTCG GGGCGCGTGC CGGTATCTTC GACATCGTCG CCAAGCTCGC CGAGAGCGGG
CTTGCGATTA TTCTGATCTC GGACGAAGTG CCGGAAGTCT ACTTCAATGC CGACCGGGTG
CTGCACATGG CCCAGGGCCG CATCGTCGGC ATCTATGATC CCCACCAGAC GCGGTTGGAA
GAGATAGAGG CGGCCGTCTA TGCATAG
 
Protein sequence
MGHGSMHQSA SAARTDQPLL SLRNINMTFG GVKALKNVTF EVRPGEVHCL AGENGSGKST 
LIKVITGVYR PAEGAIIEYD GDVYPHMSPV TAQERGIQVI WQDLALFPEM SVAENIAFHE
VLGRPRLVDY SRMRQIAIEA LSRLGITLDV DLPLKEYAIA QRQIVAIARA LVGEAKVVFM
DEPTASLTQS ETDYLLEIVR GLSASGVAVV FVSHRLAEVL EISSRITVLR DGALVGVYPA
DGMTQSKITE LMTGKTFDQH VRARAKDDQP VVLDVRGLGS PGQFEDVSLT VRRGETVGIT
GLLGAGRTEL ALALFGMLKP TSGTFSIDGR EARFASNRDA IKAGVAYLSE DRLSLGLIQP
QSIADNLVIA SLHKILSGGL LADDRKRSLV ARWIADLGVK IGHQADAIST LSGGNQQRVA
IAKWLATDPK LLILDSPTVG VDVGARAGIF DIVAKLAESG LAIILISDEV PEVYFNADRV
LHMAQGRIVG IYDPHQTRLE EIEAAVYA