Gene Smed_0538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0538 
Symbol 
ID5321372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp581506 
End bp582612 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content64% 
IMG OID640789472 
Productextracellular ligand-binding receptor 
Protein accessionYP_001326229 
Protein GI150395762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0513259 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.872247 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTAT CATTATTGAG GGGAATGACC TTGGCCGCCG GCGTCGCTTT CGCGCCGCTC 
GCCCACGCCG ACATCACCAT CGGCGTCATC ACGCCGCTCA CCGGTCCCGT TGCGGCCTTT
GGCGAGCAGG TCAAGAACGG CGCCGAGGCG GCAGTCGAAG CGATCAACAG CGCCGGCGGC
GTCAATGGGG AGAAGCTCGT CCTCAAGATC GTCGACGACG CGGGTGAACC CAAGCAGGCC
GTTTCCGTCG CCAACCAGCT GGCGGGCGAA GGCGTACGAT ACGTCGTCGG TCCGGTGCTC
TCCGGTACGT CGATGCCGGC ATCCGACGTA CTGGCGGAAA ACGGAATCCT CATGGTCACG
CCGACGGCGA CCACGCCCGA CCTTACCACC CGTGGTCTGT GGAACGTGCT GCGCACTTGC
GGACGCGACG ATCAGCAGGC GGTCGTCGCC GCCGATTACG TCGTCAAGAA CTTCAAGGAC
AAGCGCGTCG CGGTGCTGCA CGACAAGGGC GCCTATGGCA AGGGCCTTGC CGACGGCTTC
AAAGCTGCGA TCAACGCAGG CGGCATTACC GAGGCGGTCT ATGAAGGCCT GACGCCGGGC
GAGAAGGATT TCGGGGCGAT CGTCACCCGC CTGAAGGCCG AGAAAGTCGA CGTCGTCTAT
TTCGGCGGCT ACCATGCAGA AGGCGGGCTG CTCGCTCGTC AGATGCATGA CCAGGGTGTC
AAGGCACAAC TCCTCGGCGG TGACGGCCTG TCCAACACCG AGTACTGGGC AATCGGCGGC
GAAGCCGCAA CCGGCACCAT CTACACCAAT GCAAGCGATG CCACGCGCAA CCCGGCCGCC
GCACCGGTAA TCGAGGCTCT CAAGGCCAAG AACATTCCGG CGGAAGCTTT CACGCTCAAC
GCCTATGCCG CCGTTCAGGT CCTCAAGGCA GGCATCGAGA AGGCCGGTTC GACCGAAGAT
GCGACCGCGG TGGCCACCGC CATAAAGTCC GGCGAGGCCA TCGACACCGT CATCGGAAAG
CTGACCTATG GCGAAAGCGG CGATCTCACC TCGCCGAGCT TCTCGCTCTA CAAGTGGGAA
GGCGGACAGA GCGTCGCGGT CGAATAA
 
Protein sequence
MRLSLLRGMT LAAGVAFAPL AHADITIGVI TPLTGPVAAF GEQVKNGAEA AVEAINSAGG 
VNGEKLVLKI VDDAGEPKQA VSVANQLAGE GVRYVVGPVL SGTSMPASDV LAENGILMVT
PTATTPDLTT RGLWNVLRTC GRDDQQAVVA ADYVVKNFKD KRVAVLHDKG AYGKGLADGF
KAAINAGGIT EAVYEGLTPG EKDFGAIVTR LKAEKVDVVY FGGYHAEGGL LARQMHDQGV
KAQLLGGDGL SNTEYWAIGG EAATGTIYTN ASDATRNPAA APVIEALKAK NIPAEAFTLN
AYAAVQVLKA GIEKAGSTED ATAVATAIKS GEAIDTVIGK LTYGESGDLT SPSFSLYKWE
GGQSVAVE