Gene Smed_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3820 
Symbol 
ID5318012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp273835 
End bp275421 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content61% 
IMG OID640775632 
ProductABC transporter related 
Protein accessionYP_001312565 
Protein GI150375969 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATGC AGTCGACTAT CATTGCCGAG CAAAAGATGC CCTCTGTCGA GGCGCAGGGC 
GGTGGGACGC CCTTCCTGGA AGTTCGCAAC CTCGAGAAGC GCTTCGGCGG CGTTCGCGCT
CTCAAGGGCG TGAGCTTCAC GATAGAACGC GGCCGTACCT ACCATCTCAT GGGCGAGAAT
GGATGCGGAA AGAGCACGCT CATCAAGATC ATCTCGGGCG CCCAGCCGGC CGATGGCGGT
GAGCTTGCGA TCAATGGCAA GAAAACCGAA GGCCTGACGC CGATCGGCGC CTTGGCGGCA
GGAATCGAGA CGGTTTATCA GGATTTGTCG CTTCTTCCCA ATCTGAGTGT AGAGGAAAAC
GTCGCGCTGA CTCAGCAGCT GGTCGCCTCC AACGGCAGGC TTGCGCGTCG TCTCGATCTC
AAGGTTCTGC GGGAAACGTC GATCCGTGCG CTGAAGGAGG TCGGCCTGCC GACGGAGCCC
GCCTTTCTTT CGACGCCGGT CGAAGAACTA CCCATCGCGA CGCGTCAGCT CATTGCCATT
GCTCGCGCTA TCGCTTCCGA TGCCGGTCTG GTCATTATGG ATGAGCCGAC GACCGCATTG
ACGCGGCGCG AAGTTGATAA CCTCATTCGC GTCGTTCACG GCCTGCATGC CAAAGGCGTG
TCGGTCCTTT TCGTGACGCA CAAGCTCGAC GAGTGCAAGG CCATCGGTGG GCAGGCAATC
ATCATGCGCG ACGGCCTCAA GGTCGCCGAA TGCGACGTTT CCACCCAATC GAAGAGCGAA
CTTGGCTTCT GGATGACGGG CAAGAAGTTG GACGAGACGC GTTATCGCGT CGACGCGCAC
GGGGACGAGA CGCTGCTTTC GGTCGAGCAA CTCGGGGGGA GCGGCTTCGA CAATGTCAGC
TTCGCGGTTG CCAAGGGCGA GATATTCGGG ATCACGGGAC TGCTGGATTC CGGGCGGAAC
GAGCTTGCGC TTTCGCTGGC CGGCGTAGAA CCGGCGCGGC GCGGCTCCGT CCTGATGGGC
GGCAAAGCGG CAGACCTGTC TTCTCCGGCC TCGGCAATTG CGGCCGGCAT CGGCTATGTC
CCGGAGGATC GCCTGTCGGA AGGGCTTTTT CTTGGAAAAT CCATCCGCGA AAACATCGTC
ATGGCGGTCC TCGACCGGCT CCGGGGCGCA TTCGGTCTGC TGGACAGCCG CCGGGCGAGG
GCACTGGCGC AGAAGACCGT CGACGATCTG CAGGTCGCGA CGCCGGATAT CGACAATCCC
GTCACGTCTC TCTCGGGGGG CAACCAGCAG AGGGTTCTGA TCGGCCGCTG GCTGACCATC
GAGCCGAGCC TGCTCATCCT GCACGGCCCG ACCGTCGGTG TCGATGTCGG GTCGAAGGAC
ACGATTTTTC GCATCATCCA GCGTCTTGCC GGAGATGGGA TGAGCGTCAT TATCATCAGC
GACGACCTGC CCGAGCTCCT ACAGAACTGC GATCGCGTCA TGGTCATGCG CAAGGGCCGG
GTCGCCGATG TCTTCACGGC TGAAGGGCTT GAGGAAGACG TAATCTACAA ATCGATGATG
GCCGAAGCCG GCCAAGGAGT TCAATAA
 
Protein sequence
MPMQSTIIAE QKMPSVEAQG GGTPFLEVRN LEKRFGGVRA LKGVSFTIER GRTYHLMGEN 
GCGKSTLIKI ISGAQPADGG ELAINGKKTE GLTPIGALAA GIETVYQDLS LLPNLSVEEN
VALTQQLVAS NGRLARRLDL KVLRETSIRA LKEVGLPTEP AFLSTPVEEL PIATRQLIAI
ARAIASDAGL VIMDEPTTAL TRREVDNLIR VVHGLHAKGV SVLFVTHKLD ECKAIGGQAI
IMRDGLKVAE CDVSTQSKSE LGFWMTGKKL DETRYRVDAH GDETLLSVEQ LGGSGFDNVS
FAVAKGEIFG ITGLLDSGRN ELALSLAGVE PARRGSVLMG GKAADLSSPA SAIAAGIGYV
PEDRLSEGLF LGKSIRENIV MAVLDRLRGA FGLLDSRRAR ALAQKTVDDL QVATPDIDNP
VTSLSGGNQQ RVLIGRWLTI EPSLLILHGP TVGVDVGSKD TIFRIIQRLA GDGMSVIIIS
DDLPELLQNC DRVMVMRKGR VADVFTAEGL EEDVIYKSMM AEAGQGVQ