Gene Smed_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2503 
Symbol 
ID5323370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2598598 
End bp2600865 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content63% 
IMG OID640791445 
Productphosphoenolpyruvate-protein phosphotransferase PtsP 
Protein accessionYP_001328168 
Protein GI150397701 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.118134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGACC TTTCCGCAGG TCCGCGCGTT CTCCTCAAGC GGTTGCGCGA GCTCATGGCG 
GAACCGCTCG AGCCGCAAGA GCGCCTTGAC CGGATCGTGC GCCAGATAGC GCAGAACATG
GTCGCGGAAG TGTGCTCGGT TTACGTGCTG CGCTCTGATG GCGTGCTGGA ACTCTACGCG
ACCGAGGGCC TGAACAAGAC GGCGGTGCAC CTGGCCCAGT TGAAGATGGG GCAAGGTCTC
GTCGGCACGA TCGCCGCTTC GGCGCGGCCG CTCAACCTCT CCGATGCCCA GTCGCACCCG
GCCTTCACCT ATCTGCCGGA GACGGGCGAA GAGATCTATC ATTCCTTCCT GGGCGTTCCG
ATTCTACGCA CCGGCCGTGC GCTCGGGGTG CTGGTCGTGC AGAACAAGGC GCAGCGCAAC
TATCGCGAGG ACGAGGTCGA GGCGCTGGAA ACGACCGCCA TGGTACTGGC CGAAATGGTC
GCGGCCGGCG AGCTCAAGAA GATCACCAAA CCCGGGCTGG AACTCGATCT TTCGCGTCCG
GTGACGATCG ATGGCAATAG CTACGGCGAA GGCATCGGGC TCGGTTACGT CGTTCTGCAT
GAGCCGCGCA TCGTCGTTAC CAATCTGCTC AACGAAGACA CGGAGCAGGA ACTGCAGCGT
CTTGCAGAGG CGCTCGGGTC ACTGCGCATC TCGATCGACG ACATGCTGTC GCGCCGCGAC
GTATCGATGG AGGGCGAGCA CCGCGCGGTG CTCGAGACCT ATCGCATGTT CGCCCATGAC
CGCGGTTGGG TGAGAAAACT CGAGGAGGCG ATTCGCAACG GTCTGACGGC GGAAGCGGCG
GTCGAGCGGG TGCAGAGCGA AACCAAGGCC CGCATGATAC GCCTGACCGA TCCCTATCTC
AGGGAACGGA TGCACGATTT CGACGATCTT GCGAACCGGC TGTTGCGGCA GCTTTCCGGT
TACGGCGCCA AGCGTTCGGC TGCCGACTTC CCAAATGACG CAATTGTCGT AGCCCGTGCC
ATGGGCGCCG CGGAGCTACT CGACTATCCG CGAGAAAACG TGCGCGGTCT CGTGCTGGAA
GAGGGCGCCG TCACCAGTCA TGTGGTGATC GTCGCCCGGG CCATGGGCAT TCCGGTCGTG
GGACAGGCGG CAGGCGCAGT GGCGCTTGCG GAAAACCGAG ATGCGATCAT CGTCGACGGC
GACGATGCGA AGGTGCATCT GCGGCCAATG GCGGACCTCC AGCGCGCATA CGAGGAAAAG
GTACGTTTCC GCGCGCGCAG ACAGGCGCAG TTCCGTGCAC TCAGGGACGT CGAACCGCTG
ACGAAGGACG GCAAGCGCAT CACCTTGCAG ATGAATGCCG GCCTGCTGGT CGACTTGCCC
CATCTGAACG AAGCCGGTGC CGAAGGTATC GGGCTGTTCC GTACCGAGCT GCAATTCATG
ATCGCTTCGA CCATGCCGAA AGCCGAGGAG CAGGAGGTCT TTTATCGCAA TGTCTTGAAG
CAGACAGGGA CCAAGCCGGT GACCTTCCGG ACCCTGGATA TCGGCGGCGA CAAGGTCGTC
CCCTATTTCC GGGCAGCGGA TGAGGAGAAT CCGGCACTCG GCTGGCGCGC CATCAGGCTG
TCGCTCGACC GGCCGGGCCT CCTGCGCACC CAGTTCAGGG CTATGCTCAA GGCGACGGCC
GGCGCCGAGC TCAAGATGAT GCTGCCGATG GTAACGGAAG TGACCGAGCT CAAGGCCGCG
CGCGGACTTC TGCAAAAGGA GATCGAGCGT CAGTCGAAGC TCGGCGAACA ATTGCCGCGC
AAGCTGCAGT TCGGGGCGAT GCTGGAAGTG CCCGCCCTGC TTTGGCAGCT CGACGAATTA
ATGGCCGAGG TCGATTTCGT TTCGGTCGGC TCAAACGATC TCTTCCAGTT CGCCATGGCT
GTCGACCGCG GCAATGCCCG TGTTTCCGAT CGCTTCGATG TTCTCGGCCG GCCGTTCCTG
CGGATACTCC GGGAGATCGT GCGTGCTGGC GAACGCAACT GCACACCCGT GACCCTTTGC
GGGGAAATGG CGAGCAAACC GCTTTCCGCC ATGGCGCTGC TCGGACTTGG TTTCCGGTCG
GTTTCGATGT CGCCGACGGC TGTCGGGCCG GTGAAGGCGA TGCTTCTGGC GCTCGACGCC
GGCAGGCTCA ACGAACGCCT CGAGGCTGCG CTCGACGACG TCAAATCCGA CGCGTCGATC
CGCCAGCTCC TGGTCGATTT CGCTGCCGCG AACGGCATAC CGGTTTAG
 
Protein sequence
MRDLSAGPRV LLKRLRELMA EPLEPQERLD RIVRQIAQNM VAEVCSVYVL RSDGVLELYA 
TEGLNKTAVH LAQLKMGQGL VGTIAASARP LNLSDAQSHP AFTYLPETGE EIYHSFLGVP
ILRTGRALGV LVVQNKAQRN YREDEVEALE TTAMVLAEMV AAGELKKITK PGLELDLSRP
VTIDGNSYGE GIGLGYVVLH EPRIVVTNLL NEDTEQELQR LAEALGSLRI SIDDMLSRRD
VSMEGEHRAV LETYRMFAHD RGWVRKLEEA IRNGLTAEAA VERVQSETKA RMIRLTDPYL
RERMHDFDDL ANRLLRQLSG YGAKRSAADF PNDAIVVARA MGAAELLDYP RENVRGLVLE
EGAVTSHVVI VARAMGIPVV GQAAGAVALA ENRDAIIVDG DDAKVHLRPM ADLQRAYEEK
VRFRARRQAQ FRALRDVEPL TKDGKRITLQ MNAGLLVDLP HLNEAGAEGI GLFRTELQFM
IASTMPKAEE QEVFYRNVLK QTGTKPVTFR TLDIGGDKVV PYFRAADEEN PALGWRAIRL
SLDRPGLLRT QFRAMLKATA GAELKMMLPM VTEVTELKAA RGLLQKEIER QSKLGEQLPR
KLQFGAMLEV PALLWQLDEL MAEVDFVSVG SNDLFQFAMA VDRGNARVSD RFDVLGRPFL
RILREIVRAG ERNCTPVTLC GEMASKPLSA MALLGLGFRS VSMSPTAVGP VKAMLLALDA
GRLNERLEAA LDDVKSDASI RQLLVDFAAA NGIPV