Gene Smed_4889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4889 
Symbol 
ID5318051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1399372 
End bp1400373 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content61% 
IMG OID640776674 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001313606 
Protein GI150377010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.64177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAC TGCTTGCGTC TACATGTCTC ATGCTTTGCC TCACGGCGGG AGCATCGGCG 
TCGAATGCTG CCGAATGTGG GAGCGTCACC ATCGCCAGCA TGAACTGGCA GAGTGCCGAG
GTCCTCTCGA ACCTGGACAA GTTCATTCTC AACGAAGGTT ACGGGTGCAG CGCCGAGATA
ACGATTGGCG ATACCGTGCC GACAATTACC TCCATGGCGG AGAAAGGTCA GCCCGATATA
GCACCCGAAG CCTGGATCGA CCTCCTGCCC GACGTCGTCA AGAAGGGGCA GGACGACGGT
CGTATCGTCA CGGTCGGTTC CCCGTTGCCG GATGGCGGCG TGCAGGGCTG GTGGATTCCG
AAGTATCTTG CCGACGCCCA CCCGGATATC AAAACAATCG GCGACGCTCT GAAGCACCCC
GAGCTCTTCC CCGCCCCCGA GGATTCGAGC AAGGGCGCTC TGCTGAACGG ACCGCAGGGC
TGGGGCGGCA CAGTCGTGAC GACGCAGCTT TTCAACGCGT TCGACGGCGA GAAAGCCGGA
TTCACCCTGA TCGATACCGG CTCTGCCGCC GGCCTGGATG GCGCCATCGC CAAGGCGTAT
GAGCGCAAGG AAGGTCTTTT TACCTATTAC TGGTCCCCGA CTGCCCTCCT CGGCAAATAC
GAGATGGTCA AGCTCGAGCC CGGCGTTCCG CACGACTCGG CCGAGTGGAA GCGCTGCAAC
ACGGTAGCGG ATTGCCCCGA TCCCAAACCG AACGCATGGC CCGTCGACAC GATCGTGACG
CTGGTCGCCA AGCCCTTTTC CGAGCGGGTC GGCCCCGAGG TGATGGATTA TCTGACCAAG
AGGTCCTGGA GCAACGAGAC CGTCAGCAAG TTGATGGCCT GGATGACCGA CAATCAGGCA
AGCGGTGAAG AAGGTGCGAA GCGCTTCCTC GAAGAGAACC AAGACATGTG GTCGAAGTGG
GTCTCGCCCG AGGCCGCGGA GAAGATCAAA GCCGCGCTCT GA
 
Protein sequence
MNKLLASTCL MLCLTAGASA SNAAECGSVT IASMNWQSAE VLSNLDKFIL NEGYGCSAEI 
TIGDTVPTIT SMAEKGQPDI APEAWIDLLP DVVKKGQDDG RIVTVGSPLP DGGVQGWWIP
KYLADAHPDI KTIGDALKHP ELFPAPEDSS KGALLNGPQG WGGTVVTTQL FNAFDGEKAG
FTLIDTGSAA GLDGAIAKAY ERKEGLFTYY WSPTALLGKY EMVKLEPGVP HDSAEWKRCN
TVADCPDPKP NAWPVDTIVT LVAKPFSERV GPEVMDYLTK RSWSNETVSK LMAWMTDNQA
SGEEGAKRFL EENQDMWSKW VSPEAAEKIK AAL