Gene Smed_4748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4748 
Symbol 
ID5319147 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1269479 
End bp1270456 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content58% 
IMG OID640776546 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001313478 
Protein GI150376882 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00048921 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00325994 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAATCC TGGTAAAACT GGCGGCGGGT CTAGTAGTCG CCGCTGCATT CATGGGCAAT 
GCAGCCAACG CCCAGACGGT GCTGCGCTCA TCCGACACGC ATCCGGACGG CTATCCGACG
GTCGAGGCGG TCGAGTACTT CGGTGAGCTG GTCAAGGAGC GTACGGCCGG CCGCTACTCC
GTCGAGGTCT ATCACTCCGC GCAACTCGGG GAGGAAAAGG ACACGATCGA GCAGGTGCGT
TCCGGCGTCA TCGAGCTGAA CCGCGTCTCG ATGGCCCCCT TCAACGGTAC GGTGAAGGAA
TCGATCGTTC CGGCGCTTCC CTACCTCTTC CGTTCGGAAG AGCACATGCA CAAGGTGATG
GACGGGGCGA TCGGCGACCA GATCAAGACG GCCTTCGAAA GCGCCGGAGT GGTGGTGCTC
GCCTTCTATG ACGCTGGCGC GCGTTCCTTC TACAACAAAC AGAAGCCGAT CAGTTCGGTT
GCCGACATGA AAGGCTTGAA GTTCCGCGTG ATCCAGTCCG ACATCTTCGT GGACATGGTG
GCCGCGCTCG GGGCGAACGC TACGCCCATG CCTTACGGTG AAGTCTATTC CGGAATCGAA
ACGGGCGTCA TCGACGGCGC GGAGAACAAT TTTCCAAGCT ACGACACCGC CAAGCATTTC
GAAGTTGCCA AGAACTATTC GCTCGACGAA CACACCATCC TTCCGGAGGT ATTCGTCATG
AACAAGGCCG TCTTCGATAA ACTCACGCCG GAAGATCAGG AGATATTCAA GCAGGCCGCA
AAGGACAGTG TCGCCAAACA GCGCGAGCTC TGGGCTGCCA AGGTCAAGGA GTCGCGTGGG
AAGGTCGAAG CGGCCGGCGC GCAGATCACC ACACCCGAAA AGCAGGGTTT CATCGATGCA
ATGAAGCCGG TCTACGAAAA GCACGTTACC GATGCCGTCC TGAAGAAAAT GGTTGAGGAC
GTGCGCGCGG TACAGTGA
 
Protein sequence
MKILVKLAAG LVVAAAFMGN AANAQTVLRS SDTHPDGYPT VEAVEYFGEL VKERTAGRYS 
VEVYHSAQLG EEKDTIEQVR SGVIELNRVS MAPFNGTVKE SIVPALPYLF RSEEHMHKVM
DGAIGDQIKT AFESAGVVVL AFYDAGARSF YNKQKPISSV ADMKGLKFRV IQSDIFVDMV
AALGANATPM PYGEVYSGIE TGVIDGAENN FPSYDTAKHF EVAKNYSLDE HTILPEVFVM
NKAVFDKLTP EDQEIFKQAA KDSVAKQREL WAAKVKESRG KVEAAGAQIT TPEKQGFIDA
MKPVYEKHVT DAVLKKMVED VRAVQ