Gene Smed_2307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2307 
Symbol 
ID5323168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2386897 
End bp2387946 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID640791245 
ProductABC transporter related 
Protein accessionYP_001327974 
Protein GI150397507 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR03415] choline ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.77844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCGTCATTTT CAAGAATGTC GACATCATCT TCGGCAAAAA TCCGCAGATC 
GCAACGCAAA TGGTCGATCA GGGCAAGACG CGGGACGAGA TCGGTGCTGC CACCGGGCTG
GTCCTGGGGG TCGCCGGCGC TTCGCTGACC ATCAACGAGG GCGAGATTCT CGTTCTGATG
GGGCTGTCCG GTTCCGGCAA GTCGACGCTG CTCAGGGCCG TCAACGGCCT TGCGCCGGTT
GTGCGCGGCG AGGTCGAGGT GAAGACCGGG AACGGGGCTC TCAACCCTTA TCGCTGCAAC
GCCAAGTCTC TGCGGGACTT CCGCATGCAT ACGGTCTCGA TGGTGTTCCA GCAGTTTGCC
CTTCTGCCGT GGCGAAGCGT GGCGGACAAT GTCGGTTTCG GGCTCGAATT GGCAGGCGTA
GCCGATGCCG AACGGCGCAA GCGCGTCGAC GAGCAGCTTG AACTCGTCAA TCTTACGCAA
TGGGCGGATC GCAAGGTCAA CGAACTCTCA GGCGGCATGC AACAGCGCGT CGGCCTTGCC
AGGGCCTTTG CCACCGGAGC CCCTATCCTT CTGATGGACG AACCGTTCTC GGCACTCGAC
CCGCTGATCC GCACACGCCT TCAGGACGAA TTGCTCGAAT TCCAGCGGCG GTTAAAAAAA
ACGATCATCT TCGTCAGCCA CGACCTCGAC GAGGCCTTCC GCATCGGCAA CCGGATCGCC
ATCATGGAAG GTGGAAGAAT CATCCAGTGC GGAACGCCGC AGGAGATCGT GAGGAGCCCG
GCAAACCAGT ATGTCGCCGA TTTCGTCCAG CACATGAATC CGATTTCGAT GCTGACGGCG
AAGGATGTGA TGCAGAGCGG TGTCGGGCGA ACCGCTGCAA GTACCGGCGT CACGGCGACC
GCAAAGCCAA CCACGCCACT CGTCGATATT CTCGATGCCA TGTCGCGCCA GCCGGGCAGC
ATAGGTGTGG TCGACAACGG CGCGGTCGTC GGTACTATCG ACGCGCAGAA CATCGTCGAG
GGACTGACGC GCCACCGCAG CAAAAACTGA
 
Protein sequence
MTDAVIFKNV DIIFGKNPQI ATQMVDQGKT RDEIGAATGL VLGVAGASLT INEGEILVLM 
GLSGSGKSTL LRAVNGLAPV VRGEVEVKTG NGALNPYRCN AKSLRDFRMH TVSMVFQQFA
LLPWRSVADN VGFGLELAGV ADAERRKRVD EQLELVNLTQ WADRKVNELS GGMQQRVGLA
RAFATGAPIL LMDEPFSALD PLIRTRLQDE LLEFQRRLKK TIIFVSHDLD EAFRIGNRIA
IMEGGRIIQC GTPQEIVRSP ANQYVADFVQ HMNPISMLTA KDVMQSGVGR TAASTGVTAT
AKPTTPLVDI LDAMSRQPGS IGVVDNGAVV GTIDAQNIVE GLTRHRSKN