Gene Smed_1767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1767 
Symbol 
ID5322625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1851013 
End bp1852011 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content60% 
IMG OID640790705 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001327437 
Protein GI150396970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.473646 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0264882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TCCTTGCTCC CCTCCTACTC GCTGCGGGCA TATGCCTTAT GGCGGGTTCC 
TCAGGAGCTG ACGAATGCGG AAACGTCACT ATTGCCGAGA TGGACTGGGC CTCGGCTGGC
GTTGCGGCTC GCGTGGACAG GCTCATCCTT GAAAACGGCT ATGGATGCAG CGTCAAGCTG
GTCCCCGGCG ACACTATGCC GACATTCTCT TCGATGAACG ATAAGGCAGA GCCCGACCTG
GTCCCGGAAC TCTGGATCAA CTCCGTGCGG ACGCCTTTCG ACGCCGCAGT CAAGGAGGGT
CGCCTCGTCG AAGGCGCCAG GGTCTTGAGC GAAGGCGGGG TAGAAGGCTG GTGGATTCCA
AAGTTCATCG CCGACGCACA CCCGAAGATC CGAACGGTGC AGGACGCTTT GAAGCACCCG
GAACTCTTTC CCTCACCCGA CGACCCCGCA AAGGGGGCAG TCCATAATTG CCCATCGAGC
TGGAACTGCC GGTTCTCCAC CGCCAACCTC TACAAGGCAC TCGAAGGGGA TAAGGCCGGC
TTCCAGCTTG TCGACCCAGG CTCGGCCGCC GGCCTCGACG GTTCTATCGC CGAAGCATTC
GAGAAGAAGA CCGGCTGGCT CGGCTATTAT TGGGCCCCCA CCGCAGTGCT CGGTAAATAC
GAGATGACGT TGTTGTCCTT CGGCGTCGAT CACGACAAGG CGGAGTGGGA CTCCTGCACC
GCCGTTCCGG ACTGCCCCGC CCCGAAGGTA AACTCCTATC CGGTCTCCGA TGTCTACACT
GTGGTGACCG GGTCCTTTGC CGACAGAGCA GGTAGCGCCA TGAATTATGT CGGGGCGCGG
CAATGGGAGA ACGCGACGAT CAACAAGGTT CTGGCCTGGA TGGATCAGAA CCAGGCCACG
AACGAAGACG CGGCCCGCTA CTTCCTCGGA AACTTTCAAG AGATATGGAC GCAGTGGGTG
AGTCCGGAAG TCGCCGAAAA AGTGAAGGCG GCGCTCTGA
 
Protein sequence
MKKLLAPLLL AAGICLMAGS SGADECGNVT IAEMDWASAG VAARVDRLIL ENGYGCSVKL 
VPGDTMPTFS SMNDKAEPDL VPELWINSVR TPFDAAVKEG RLVEGARVLS EGGVEGWWIP
KFIADAHPKI RTVQDALKHP ELFPSPDDPA KGAVHNCPSS WNCRFSTANL YKALEGDKAG
FQLVDPGSAA GLDGSIAEAF EKKTGWLGYY WAPTAVLGKY EMTLLSFGVD HDKAEWDSCT
AVPDCPAPKV NSYPVSDVYT VVTGSFADRA GSAMNYVGAR QWENATINKV LAWMDQNQAT
NEDAARYFLG NFQEIWTQWV SPEVAEKVKA AL