Gene Smed_1442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1442 
Symbol 
ID5322294 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1522726 
End bp1523736 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content59% 
IMG OID640790385 
ProductTRAP dicarboxylate transporter, DctP subunit 
Protein accessionYP_001327123 
Protein GI150396656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.201981 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000183061 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATTA CCCGTAGAAC GATGCTGCTC GGCACCGGGG CAGTGGCCAT CGGGGCGTCG 
ATCCGCACAA AGGCTCATGC GGCGGACTTC ACCTACAAGC TGGCAAACAA CCTGCCGGTT
ACCCACCCGC TCAACATCCG CCTGAAGGAA GCGGCCGACA AGATTCTGGA GGAGACTGGA
GGTCGGCTGA AGATCAACAT CTTCCCGAGC AGCCAGCTCG GCAACGACAC CGAAACCCTT
TCGCAGCTTC GCAACGGCGC AACCGAGTTC TTCTCGCTCT CGCCGCTGAT CCTGTCGACC
CTTGTTCCCA ACGCCGCCAT CAGCGGTATC GGCTTCGCCT TCCCCGACTA CGACACCGTC
TGGAAGGCCA TGGACGGAGA GCTCGGTGCC TATGCACGCG GCGAGATCGA GAGAAAGGGT
CTCGTGCCCA TGGAAAAGAT CTGGGACAAC GGCTTTCGGC AGATCACCAG CTCGAATAAG
CCGATCAACA CGCCGGAGGA TCTCAAAGGA TTCAAGATCC GTGTGCCGCC GAGCCCGCTC
TGGCAATCCA TGTTCACAGC CTTCGGCGCC GCACCGACCA CGATCAATTT TGCCGAGGTC
TACACGGCGC TCAGCACCGG CACGGTGGAC GGCCAGGAAA ATCCGTTGGC AATCGCATCC
ACCGCCAAGC TTTATGAGGT TCAGAAGCAT TGCGCCATGA CCAACCATAT GTGGGATGGT
TTCTGGATGC TGGCGAACAA GCAGGCCTGG GAGAGACTGC CTGAAGACAT CAGGGAGATT
GCAGCCAGAA ACCTGAACGA ATCGGCCACG GCCCAGCGGA CCGATACCGC CAAGGTGAAC
GACACAGTCC GCGAACAACT GACGCAGAGT GGCATGACTT TCACCGATCC GGATAAGGCT
GCGTTCCGCA AAACGCTGAG GTCGGCGGGT TTCTATGCCG AATGGAAGGG CAAGTTCGGT
GACGAGGCGT GGGCCATCCT GGAAAAGGCG GTAGGGACCA GCCTGGGCTG A
 
Protein sequence
MKITRRTMLL GTGAVAIGAS IRTKAHAADF TYKLANNLPV THPLNIRLKE AADKILEETG 
GRLKINIFPS SQLGNDTETL SQLRNGATEF FSLSPLILST LVPNAAISGI GFAFPDYDTV
WKAMDGELGA YARGEIERKG LVPMEKIWDN GFRQITSSNK PINTPEDLKG FKIRVPPSPL
WQSMFTAFGA APTTINFAEV YTALSTGTVD GQENPLAIAS TAKLYEVQKH CAMTNHMWDG
FWMLANKQAW ERLPEDIREI AARNLNESAT AQRTDTAKVN DTVREQLTQS GMTFTDPDKA
AFRKTLRSAG FYAEWKGKFG DEAWAILEKA VGTSLG