Gene Smed_2583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2583 
Symbol 
ID5323451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2684111 
End bp2685145 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content59% 
IMG OID640791526 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001328248 
Protein GI150397781 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.201596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA GCACAATGAA ACTCACCGTC GCGGCAGCCG GCCTGATGCT GGCGGCATCC 
GCCGGCGGCG CCAATGCTTC CTATTGCGGC GATGGCAAGA CCGTAACCTT CGCCGGCATA
GATTGGGAAA GCGGCGCCTT CATTACCGAG GTCATGAAAA CGATCCTTTC CAAGGGATAC
GACTGCCAGG TCGACTCGAT TCCCGGCAAT TCCGTAACTC TCGAGCAGGC AACGGCCAAT
AACGACGTTC AGATTTTCGC CGAGGAATGG CTCGGCCGTT CGGACATCTG GAACAAGGCG
GTGGAAGAGA AGAAGGTGGT TGCCGTCGGC AAGACCTTTG TAGGCGCCAG CGAAGGCTGG
TTCGTGCCGG ACTATGTCGT CAAGGGCGAT CCCGCTCGCA ATATCGAACC CAAGGCGCCG
GACCTGAAAA GCGTATCCCA ACTTGCCGAC CCGAAAATCG CCGAGATCTT CGCTGATCCG
GAGGAACCGT CCAAGGGCCG CTTCCTGAAC TGCCCTTCCG GCTGGACCTG CGAGGGCGTG
AGCACGGCCA AGCTCGAGGC CTACAAGCTT GGCGAGTCCT ACGTGAACTT CCGCCCGGGG
ACGGGAACGG CGCTCGATGC GGCGATTACC TCTGCCTATC TCCAGGGAGA GCCGATCCTC
TTCTATTATT GGTCGCCGAC CGCGATCATG GGTAAATACA AGCTGATCCA GCTCGAAGAG
CCGCCCTATA ACGAAGCCTG CTGGAAAGAA CTGAGCAGTG CCAACGGCAA GCGTGACGAA
GGCTGCGCCT TTCCCTCCGT CGACGTCTCT TATGGCGTGA ACAGCACCTT CGCTTCCGAG
GCGCCGGAAA TCATCGAGAT CCTGGAAAAG GCAACTTTTC CGCTCGAGGA GGTCAATGCC
AGCCTCGCCT ATATGACGGA TAATAAGGTC GATGCGACGG CAGCCGCCGC GCAGTTCCTG
AAAGCCAAGG GCGACATCTG GGGCAAGTGG GTCTCGGACG AGGCACGCGG CAAAATCGAA
GCTGGCCTCG AATAA
 
Protein sequence
MSISTMKLTV AAAGLMLAAS AGGANASYCG DGKTVTFAGI DWESGAFITE VMKTILSKGY 
DCQVDSIPGN SVTLEQATAN NDVQIFAEEW LGRSDIWNKA VEEKKVVAVG KTFVGASEGW
FVPDYVVKGD PARNIEPKAP DLKSVSQLAD PKIAEIFADP EEPSKGRFLN CPSGWTCEGV
STAKLEAYKL GESYVNFRPG TGTALDAAIT SAYLQGEPIL FYYWSPTAIM GKYKLIQLEE
PPYNEACWKE LSSANGKRDE GCAFPSVDVS YGVNSTFASE APEIIEILEK ATFPLEEVNA
SLAYMTDNKV DATAAAAQFL KAKGDIWGKW VSDEARGKIE AGLE