Gene Sala_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2149 
Symbol 
ID4080146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2258079 
End bp2260376 
Gene Length2298 bp 
Protein Length765 aa 
Translation table11 
GC content67% 
IMG OID638010527 
Productphosphoenolpyruvate-protein phosphotransferase PtsP 
Protein accessionYP_617191 
Protein GI103487630 
COG category[T] Signal transduction mechanisms 
COG ID[COG3605] Signal transduction protein containing GAF and PtsI domains 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACG CCCCGCCCCC GCCCTCGACG GCCGCCCAGT CGGCGCGCAC CATATTGACC 
CGGCTGCACG AGGTCATGGC GTCGCGTGCC AATGCGCAGG GCAAGCTCAA TCAGGTCGTC
GGGATCATCG GCGAATGCCT CGACAGCGAA GTCTGCTCGA TCTACCTGCT GCGCGACGGC
GCGCTCGAAC TTTATGCGAC GCGCGGCCTC AAGCAGGAGG CGGTGCATGT CACACGCCTT
GCGCCCGGCG AAGGGCTTGT GGGCACCATT GCCGAACATA TCGAAACGCT CAATCTCGAC
GAAGCCGCAG CGCACCCCGA CTTTTCCTAT CGCCCCGAAA CGGGCGAGGA ATTGTTCCAC
AGCTTTGCCG GGGTGCCGAT CATCCGTCGC GAGCGCGCCG TTGGCGTCCT TTGCGTCCAG
CACGCGGAAC CGCGCCGTTA TGAAGAGATC GAGATCGAGA CGCTGCAAAC GGTCGCGATG
GTGCTGTCCG AACTGATCGC CAACGCCGAC CTTGTCGATA CCGCCGCACG CACCGACGCC
GCGGCCGCCG ACCAGTCGGC GCAGCGGCTG AACGGACAGA AGCTGGTCGA CGGCATGGGC
GCCGGGGTCG CGGTATTCCA TCAGCCGCGC ATCACCATCG AACATACGGT CGCCGACGAT
GTCGAGGCCG AACGCCACCG CGTCTACGCC GCCTTCGACA AGATGCGCGA ACAGATCGAC
CGCATGGCAA GCCAGGCCGA ATTCGGCGTC GGCGGCGAGC ATGAAGAGGT GCTCGAAACC
TACAAGATGT TCGCCTATGA CGAAGGCTGG TCGCGGCGGA TCAACGAAGC GATCGACAGC
GGCCTGACCG CAGAGGCGGC GATCGAGCGC GTCCAGCAAC GCACGCGGAT GCGGATGCGC
CAGATCGACG ATCCGCTGCT CCGCGACCGG ATGCACGACC TCGAAGATCT GTCGAACCGG
CTGATCCGCA TCGTGTCCGG ACAGATGGGC ACCGCAGCGC AAATGGGGCT CCGGCAGGAT
TCGATCCTGA TCGCACGCAA CCTCGGTCCG GCCGAACTGC TCGAATATGA TCGCCGCCGC
CTGAAAGGCG TCGTGCTCGA AGAAGGCTCG CTGACCGCCC ATGTCATCAT CGTTGCGCGC
GCGATGGGCG TCCCGGTCAT CGGCCGCGTC CGCGACGTGC GCACCTCGAT CCGCGAGGGC
GACCTGCTGC TGCTCGATGC CAGCGCGGGC ACTGTGCATG TCCGCCCGAC GCCCGCGGTG
CAGGAGGCTT TCAACGCCAA GCTCGCCATC TCGCAAAAGC GCCGCGCCGA TCTTGCGGCG
CTGCGCGACC TTCCTGCGGT CACCAGGGAT GGGGTTCCGA TCGAGTTGAT GATCAACGCC
GGCCTGCGCG AGGATGTCGC TGCGCTCGAC CTCACCGGCG CGCGCGGGAT CGGGCTGTTC
CGCACCGAGT TCCAGTTCCT CGTCTCGGCG ACGCTGCCCG CGCGCGAACG GCAGCAGCGG
CTGTATCGCG ACGTGCTCGA TGCCGCGGGC GACCGCCCGG TCATCTTCCG CACCGTCGAC
ATCGGCGGCG ACAAGGCGCT GCCCTATATG AATGTTGGCG AAGGCGCGCA GGAGGAAAAC
CCGGCGATGG GCTGGCGCGC GCTCCGTCTC GCGCTCGACC GCGAAGGTCT TTTGAAGGTG
CAGGCGCGCG CGCTGATGGA AGCCGCGGCC GGGCGTACGC TCAACGTCAT GTTCCCGATG
GTGTCGGAAC CATGGGAATA TGAGGCGGCG CGCGCCTTGT TCGTCGGCCA GCGTGCCTGG
CTCGCCAGCC ACAACAAGAA GCTGCCGGTG GCGATCCGCT ATGGCGCGAT GCTCGAGGTG
CCCGGGCTGG TCGAAACGCT CGACCTGATG CTGCCGCACC TCGACTTCCT GTCGATCGGC
ACCAACGACC TCACCCAGTT CCTCTTCGCC GCCGACCGGG CGCACCCGCG GCTCGCCGAA
CGCTACGACT GGCTGTCGCC GACGGTGATG CGCTATCTGG CGCGCGTGGT GAAGATCGTG
TCGGGATCGA AGGTCGCGCT CGGCGTGTGC GGCGAAATGG GGGGACGCCC CCTGGAAGCC
ATGGCGCTGC TCGGTCTCGG CATCGAACGC CTGTCGATCA CCCCCGCGGG CGTCGGCCCG
GTCAAGGCGA TGATCCGCTC GCTCGACCTC GGCGCGCTGC GCGCCGACAT GCCCGCGATG
CTGGCGCAGC CGGCGCCGGA CCCGCGCGGG CAGTATGAGC AATGGGCGCT CGCCCATCAG
GTCGACCTGG GCACGTAA
 
Protein sequence
MTNAPPPPST AAQSARTILT RLHEVMASRA NAQGKLNQVV GIIGECLDSE VCSIYLLRDG 
ALELYATRGL KQEAVHVTRL APGEGLVGTI AEHIETLNLD EAAAHPDFSY RPETGEELFH
SFAGVPIIRR ERAVGVLCVQ HAEPRRYEEI EIETLQTVAM VLSELIANAD LVDTAARTDA
AAADQSAQRL NGQKLVDGMG AGVAVFHQPR ITIEHTVADD VEAERHRVYA AFDKMREQID
RMASQAEFGV GGEHEEVLET YKMFAYDEGW SRRINEAIDS GLTAEAAIER VQQRTRMRMR
QIDDPLLRDR MHDLEDLSNR LIRIVSGQMG TAAQMGLRQD SILIARNLGP AELLEYDRRR
LKGVVLEEGS LTAHVIIVAR AMGVPVIGRV RDVRTSIREG DLLLLDASAG TVHVRPTPAV
QEAFNAKLAI SQKRRADLAA LRDLPAVTRD GVPIELMINA GLREDVAALD LTGARGIGLF
RTEFQFLVSA TLPARERQQR LYRDVLDAAG DRPVIFRTVD IGGDKALPYM NVGEGAQEEN
PAMGWRALRL ALDREGLLKV QARALMEAAA GRTLNVMFPM VSEPWEYEAA RALFVGQRAW
LASHNKKLPV AIRYGAMLEV PGLVETLDLM LPHLDFLSIG TNDLTQFLFA ADRAHPRLAE
RYDWLSPTVM RYLARVVKIV SGSKVALGVC GEMGGRPLEA MALLGLGIER LSITPAGVGP
VKAMIRSLDL GALRADMPAM LAQPAPDPRG QYEQWALAHQ VDLGT