Gene Smed_3836 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3836 
Symbol 
ID5318544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp291530 
End bp292501 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content60% 
IMG OID640775648 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001312581 
Protein GI150375985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0128517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG GATTGATTTC GCTCGCGCTT GCGGGGCTGC TCATGGCTTC GCAAGCCATG 
GCTCAAGAGG CGCGCACGCT GCGCCTGGGG ATGCAAGGCA CGGCCGGGGA CCCGCAATTC
GAGGGTGTCA CCGAGGCCGC GCGCATCATC AAGGAAAAGT CCGGCGGTCG GCTGACGCTG
GAAATCTTCC CCAATTCGCA ACTCGGCACC TTTACCGAGA TGATGGAGCA GGTGACACTC
GGCGAACTCG ACTTCACGCT CAATCCATTC GGGGGCATGG ATGCCTGGGT TCCCCGGGCC
GTGTTGGCGA GTACTGCCTA TGTCGTCGGC GACTTCGAGC ATCTTCAAAA GATCATCGCC
TCGGACTGGG GCAAGGGGAT CGTCGACGAA TTGCGAACCG AGCACAAGTG GCGCATGGTC
GACTCCTGGT ATTTCGGAAC GCGGCACACG ACGGCAAAGA AGCCCATCGA AAAGCCTGCG
GATTTCAACG GCATGAAGCT GCGCGTACCG AATTCCGCGC CGCTTCTGAC CTGGGCGAAG
GCAATGGGCG CGAGCCCGAC CCCGGTCGCG TTCGCCGAAG TCTATCTGGC GCTCCAGACC
AATCAGGTGG ATGGTCAGGA AAACCCGCTG CCGATCATCG ACTCGATGAA ATTCACCGAG
GTGCAGACCC ATGTTTCGTT GACCGGGCAT CTGGTGCAGG ACCAGGTCGT CCTCATGTCG
GAGGATACGT GGAATGCGCT TGATCCCTCC GATCAGAAAC TCGTCATGGA GGCATTCGAG
GCTGGCGGGG CCCTCAACGA CAAGCTGGTT GCCAATAAGG AAACGAGTCT CGTCAGCGAT
TTTCGTGAGC GCGGAATCAC CGTGGTCGAA CCCGACAAGG CAGCTTTCCA GGAGGCGATG
AAGCCCGTCT ATGCCGATCT CGATGCGAGG TTCGGCGCGG GCACGGTACA GACGCTGCTC
GATCTCCGAT AA
 
Protein sequence
MKKGLISLAL AGLLMASQAM AQEARTLRLG MQGTAGDPQF EGVTEAARII KEKSGGRLTL 
EIFPNSQLGT FTEMMEQVTL GELDFTLNPF GGMDAWVPRA VLASTAYVVG DFEHLQKIIA
SDWGKGIVDE LRTEHKWRMV DSWYFGTRHT TAKKPIEKPA DFNGMKLRVP NSAPLLTWAK
AMGASPTPVA FAEVYLALQT NQVDGQENPL PIIDSMKFTE VQTHVSLTGH LVQDQVVLMS
EDTWNALDPS DQKLVMEAFE AGGALNDKLV ANKETSLVSD FRERGITVVE PDKAAFQEAM
KPVYADLDAR FGAGTVQTLL DLR