Gene TM1040_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1933 
Symbol 
ID4076884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2033657 
End bp2035897 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content59% 
IMG OID638007249 
Productphosphoenolpyruvate-protein phosphotransferase PtsP 
Protein accessionYP_613928 
Protein GI99081774 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.940648 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.328925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGAGA ACACGGAATC CGAAAGCCGC AAGCTTCTGG GTCGTCTGCG CGAGGCCATG 
GCAGGGGATG ACGCCGGACA GGCGCGCCTC GACAAGATTA CGCAGCTGAT AGCCGACAGT
ATGGGCACCG AAGTGTGCTC TATCTATCTG TTTCGCGACG AAGATACGCT CGAACTCTGC
GCCACCGAAG GGCTCAACCG CGAGTCCGTG CACCAGACCC GCATGCGCGT GGGCGAGGGG
CTTGTGGGGC GTGTGGCGCG CACGGGCAAG GTGATCAACA CCCCCGATGC GCCAAGCGCG
CGGGGCTTTC GCTATATGCC GGAAACCGGC GAGGAGCGCT TCTCCTCATT CCTGGGTATC
CCCATCCAGC GCCTTGGCGA GATGATGGGC GTTCTTGTGG TGCAATCCAA GGAAAGCCGC
GAATATTCCG CGGACGCGGT CTATGCCCTT GAAGTTGTTG CAATGGTTAT CGCCGAGATG
ACCGAACTTG GGGCCTTTGT GGGCGAGGGC TCTGCCATGT CGCCCCTGCA CCAGCAATCT
GTGCTCTTGA AGGGTATCGC CGCGCAGGAG GGGGCCGCCG AGGGCAATAT CTGGCTGCAT
GAGCCCCGCG TTGTGGTGAC GAACCCGATT GCGGACGATC CGCATCGGGA GCTGGAGCGC
CTGCAGGAGG CCGTTGAGGA GCTGCGGGTG GGTGTCGACA AGATGCTGCA GGTCTCATCG
AACGGTGACA AGGAACAGGC ACAGGTCCTT GAAACTTATC GCATGTTTGC CAACTCCAAA
GGCTGGATGC GGCGCATGGA AGAGGACATT GCCCGTGGCC TGAGCGCCGA AGCTGCAGTT
GAAAAAGAAC AATCCCTTGC AAGAGGCCGC ATGGCGCAGG TTCAGGACGC GTATTTGCGT
GAGCGCCTGA GCGATCTGGA CGATCTGTCG AACCGCCTGC TGCGTATTTT GACCGGTCAG
GGCAAGGATA CCGGCGCAGA ACTGCCGGAA AATCCCATCC TGGTCGCGCG CAATATCGGT
CCGGCCGAAT TGTTGGAATA TGGTAGGTCT TTGAAGGGAA TTATCCTTGA AGAGGGCTCT
GTAGGTTCGC ATGCGGCAAT TGTAGCGCGG GCATTGGCGA TCCCATTGGT GGTGCGTGTG
GCCCGGATCA CTACCGAAGG GCTGAATGGG GATCATGTTT TGGTGGACGG GGACAACGCT
GTTGTGCACC TGCGCCCCGA TGATCCTGTC GTCAGCGCCT TTCGCGATAA GATTGCGATG
CAAGCGAAGG CCCAGGAACG GTATTCGTCG ATACGTGAGA AACCTGCGAT AACGGTCGAT
GGAAAAAGAA TTAACCTGAT GATGAACGCA GGGTTGATGG CAGATCTGCC TTCGCTCGAG
AATTCTGGCG CAGAGGGGGT TGGTCTGTTT CGCACTGAGC TGCAGTTCCT GGTGCGCAAT
CAGATGCCCA AACGCTCCGA GCTCAAGGGG CTCTACCAGC GTGTGCTGGA GGCTGCAGGT
GGCAAACGGG TGGTGTTTCG CACCCTCGAC ATCGGGTCCG ACAAGGTTCT GCCCTATATG
AAGCCCACCG ACGAGCCGAA CCCAGCCTTG GGATGGCGCG CAATTCGGGT TGGATTGGAC
AAACCCGGTG TGATGCGGAT GCAGCTACAG GCGCTGATCC GCGCGGCCAA CGGCGGGCCT
CTTACGGTGA TGTTCCCATT TGTGGCACAA TTCGAGGAAT ACCGCCTCGC GCGCGAGGAG
GTGAACAAGA CCATCGAACG CGAGCGCCGT TTGGGACATG TTCTGCCCTC CACGCTCGAG
ATCGGCGCCA TGCTGGAGAC CCCATCGCTT GGGTTTGCAC CGCTCAAGTT CTTTGAGGAA
GTCGATTTTC TGTCGATTGG CGGCAATGAT CTCAAGCAGT TCTTCTTTGC GGCGGACCGT
GAAAACGAAC GGGTGCGACG GCGCTATGAC ACGCTCAACG TGAGCTTCCT GAGCTTTATC
GAGCGGATCG TGGAACGCTG CGACATCTCC GGCACGCCGC TGAGCTTTTG TGGTGAGGAT
GCCGGCCGCC CGATCGAAGC CGTGTGTCTT GCAGCCATGG GAATTCGCAC CCTTTCGATG
CGTCCGGCCT CGATCGGCCC GGTCAAATCT CTGCTGATGC GCGTGGATCT TGGGGAACTG
CGCAAGATCA TCTCTGATGC GCGTCATCGC GGTGATCAAA CGGTGCGACC GGCGGTCATG
CAGTACCTCC GCGAACTTTG A
 
Protein sequence
MVENTESESR KLLGRLREAM AGDDAGQARL DKITQLIADS MGTEVCSIYL FRDEDTLELC 
ATEGLNRESV HQTRMRVGEG LVGRVARTGK VINTPDAPSA RGFRYMPETG EERFSSFLGI
PIQRLGEMMG VLVVQSKESR EYSADAVYAL EVVAMVIAEM TELGAFVGEG SAMSPLHQQS
VLLKGIAAQE GAAEGNIWLH EPRVVVTNPI ADDPHRELER LQEAVEELRV GVDKMLQVSS
NGDKEQAQVL ETYRMFANSK GWMRRMEEDI ARGLSAEAAV EKEQSLARGR MAQVQDAYLR
ERLSDLDDLS NRLLRILTGQ GKDTGAELPE NPILVARNIG PAELLEYGRS LKGIILEEGS
VGSHAAIVAR ALAIPLVVRV ARITTEGLNG DHVLVDGDNA VVHLRPDDPV VSAFRDKIAM
QAKAQERYSS IREKPAITVD GKRINLMMNA GLMADLPSLE NSGAEGVGLF RTELQFLVRN
QMPKRSELKG LYQRVLEAAG GKRVVFRTLD IGSDKVLPYM KPTDEPNPAL GWRAIRVGLD
KPGVMRMQLQ ALIRAANGGP LTVMFPFVAQ FEEYRLAREE VNKTIERERR LGHVLPSTLE
IGAMLETPSL GFAPLKFFEE VDFLSIGGND LKQFFFAADR ENERVRRRYD TLNVSFLSFI
ERIVERCDIS GTPLSFCGED AGRPIEAVCL AAMGIRTLSM RPASIGPVKS LLMRVDLGEL
RKIISDARHR GDQTVRPAVM QYLREL