Gene Smed_1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1719 
Symbol 
ID5322577 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1798777 
End bp1799883 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content61% 
IMG OID640790657 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001327389 
Protein GI150396922 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.457663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCGCC GTTCATTCAT CAAAAATGCG AGCATTGGCG GCCTGGGTGC GGCTGCCGCG 
TCGGCACTGG CTGCGCCAGC CGTTGCGCAG AGCAACCCGA AGGTCACGTG GCGCTTGACC
TCGTCCTTCC CGAAGTCGCT GGACACCATC TATGGTGGTG CGGAAGTCCT GTCGAAATAC
GTGTCTGAAG CCACTGACGG CAATTTCCAG ATCCAGGCCT TCGCAGCCGG CGAGATCGTA
CCCGGTCTTC AGGCTGCAGA TGCCGCGGCT GCCGGCACCG TCGAAGCGTG CCATACGGTC
GCCTATTATT ATTGGGGCAA GGATCCGACC TGGGCGCTCG GCGCAGCCGT GCCCTTCGCG
CTCAACGCCC GCGGCATGAA CGCCTGGCAC TATCATGGCG GCGGCATCGA CCTTTTCAAT
GAGTTTCTCG GCGCCCAGGG GCTCATCGGC TTTCCGGGCG GCAATACCGG CGTCCAGATG
GGCGGTTGGT TCCGCAAGGA GATCAAGACC GTCGCCGACA TGAAGGGGCT GAAGATGCGC
GTCGGCGGTT TCGCGGGCAA GGTCATGGAA CGTCTCGGCG TCGTGCCGCA GCAGCTCGCC
GGCGGCGATA TCTATCCGGC GCTCGAAAAA GGCACGATCG ATGCGGCCGA ATGGGTCGGC
CCCTATGACG ACGAAAAGCT CGGCTTCTAC AAGGTCGCGC CCTATTACTA CTATCCCGGC
TGGTGGGAAG GCGGTCCGAC CGTGCACGCC ATGTTCAACA AGGCCGCTTA CGAAGGACTC
CCGAAGGCCT ATCAGTCGCT CCTGCGGACG GCCTGCCAGG CCACCGACGC AAACATGCTG
CAGAAGTACG ACTATCTCAA CCCGGCCGCC ATCAAGCGTC TTGTTGCGGC GGGAGCGAAG
TTGAGCCCGT TCAGCCCGGA AATCCTGTCG GCCTGCTTCG ACGAGTCCAA CAAGGTCTAT
GCGGAAATGG AATCCTCCAA CCCCGCATTC AAGAAGATCT GGGATTCGAT CAAGGCCTTC
CGCGCCGAAT ATTTCCTCAA CGCCCAGATC GCCGAATACA ACTACGATAC CTTCATGATG
ATCCAGCAGC GCAACGGCAA GATCTGA
 
Protein sequence
MDRRSFIKNA SIGGLGAAAA SALAAPAVAQ SNPKVTWRLT SSFPKSLDTI YGGAEVLSKY 
VSEATDGNFQ IQAFAAGEIV PGLQAADAAA AGTVEACHTV AYYYWGKDPT WALGAAVPFA
LNARGMNAWH YHGGGIDLFN EFLGAQGLIG FPGGNTGVQM GGWFRKEIKT VADMKGLKMR
VGGFAGKVME RLGVVPQQLA GGDIYPALEK GTIDAAEWVG PYDDEKLGFY KVAPYYYYPG
WWEGGPTVHA MFNKAAYEGL PKAYQSLLRT ACQATDANML QKYDYLNPAA IKRLVAAGAK
LSPFSPEILS ACFDESNKVY AEMESSNPAF KKIWDSIKAF RAEYFLNAQI AEYNYDTFMM
IQQRNGKI