Gene Smed_4968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4968 
Symbol 
ID5318031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1481710 
End bp1482687 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content59% 
IMG OID640776750 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_001313682 
Protein GI150377086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.565249 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0876264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC TGATCGCTGC GGCCGCAATG GCTGCCCTGT CGTTCTGCGG CATTGCCAGT 
GCGCAGGAAT ACAGCCTGCG GTTCTCGACC TCGCAGGTGA ATCCGAACGA GCCGATCATC
AAAGCGATGA AGACTTACGC CGAGCGTGTT GGTGAACGCT CCGGCGGGCG AATCGCGATC
ACCGTGATGA CGGGTGATCA GCTTGGTGCG CAAAAGAAGG TCAACGAGAT GGTCATGAGC
GGCGCGAGTC TGCTCAGTGC CACCGACTAT GGTCAGCTTG GCCAGTTCGT TCCGGATCTG
TCTATCCTTG CCGGTCCCTA TGTCTATCCG GATCTGGCCG CGACGGAGCG CCTCTTCGCA
TCGGATCTCT ACAAGGAACT TTCCGGCAAG CTGGAAGCGC GTGGTATCAA GATCATCATG
CCGAACGGCC TCTTCGGCTA CCGTCACATC ATTTCCAACA AGCCGGTTCG CTCGCCGGCT
GATCTCGCCG GCGTGACCAT TCGCGTACCC TCGTCGCCGA TCATGATGGC GACCTTCGGC
AACTACGGCG CAAGGCCGAC GGAATTGCCG TGGGGGGACG TCTACAATGC GCTTCAGACC
GGCGTCGTCG ACGCAGCCGA AGGGCCTTTC GGCTCAATAG CCGGGGCGAA ATTGAACGAG
ACCCGCAAAG TCATTTCGAA GACCGGCCAT CAGATCATGT TCACCGCCTG GGTAGCCTCC
AGCCAGTTCT TCAACGGCCT TCCCGAAGAC CTTCAAAAGA TCCTCCTCGA GGAAGGGCGG
GCGATCGCCA GTGAATTGAC GCAGATGACA CTGGAAACGG ATGACGCCTA TGCAAAGCAG
CTCTCTGCCT CCGGCGTCGA GATCGTGACC GATGTCGACA TTCCGGCTTT CATCGAGGCC
TCCCGGGCCG CCTACGACAA GGTTCCGAAT ATAACGCCCG GCATCTACGA GCAGGTACAG
AAGGCGATGA AGCAATAA
 
Protein sequence
MKTLIAAAAM AALSFCGIAS AQEYSLRFST SQVNPNEPII KAMKTYAERV GERSGGRIAI 
TVMTGDQLGA QKKVNEMVMS GASLLSATDY GQLGQFVPDL SILAGPYVYP DLAATERLFA
SDLYKELSGK LEARGIKIIM PNGLFGYRHI ISNKPVRSPA DLAGVTIRVP SSPIMMATFG
NYGARPTELP WGDVYNALQT GVVDAAEGPF GSIAGAKLNE TRKVISKTGH QIMFTAWVAS
SQFFNGLPED LQKILLEEGR AIASELTQMT LETDDAYAKQ LSASGVEIVT DVDIPAFIEA
SRAAYDKVPN ITPGIYEQVQ KAMKQ