Gene Rcas_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2048 
Symbol 
ID5539526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2624253 
End bp2626769 
Gene Length2517 bp 
Protein Length838 aa 
Translation table11 
GC content61% 
IMG OID640894182 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001432153 
Protein GI156742024 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins
[COG3412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase
[TIGR02364] dihydroxyacetone kinase, phosphotransfer subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.488537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAGTA TTGTGCTCGT TTCGCACAGT TCGTTGCTGG CGGCCGGCAT CGTCGAAATG 
GCGCGCATGG TTATGCAACA GGCGCCGGTG GCGATTGCTG TGGCGGCTGG CGCCGATGAT
CCAGGACATC CATTGGGGAC CGATGCTGCG AAGATTCGTC AGGCGATCGA AGAGGTGTAT
AGCGACGACG GCGTGCTGGT GCTGATGGAT CTGGGCAGCG CGGTGTTGAG CGCAGAAATG
GCGGTTGATT TTCTTCCTGA ACACAAGCGC GCCAATGTGA GATTATGCGC TGCGCCGATT
GTCGAAGGAA CGATTGCCGC AGTGGTGCAG GCTAGCCTGG GCGCATCGCT GGATCGCGTC
GCCGCCGAGG CGCTGGAGGC GCTGGCGGGA AAAGTCGAGA GCCTGAGCGA TCAGGGGCAG
GCATCCGGCG CTGCGGCGCC ATCCCCATCG ACCGACGCAA CCGCCGAGGT GCTGCACGCA
CAACTGACGG TGACGAACCG CCTGGGGTTG CACGCCCGAC CGGCGGCATT ATTGGTGCAG
ACGGCAGGGC GCTTTTGTGC CGATGTTCGT CTCGCGCGCG TTGGGCAGGA AACTCGTCAG
GTCAATGCAA AGAGTTTCAA TGCTGTCGCA TCGCTGGGCA TCCGCCAGCA TGAGATGATC
ACCGTTTCGG CGCGCGGACC GGACGCCGCC GAGGCGCTTG CGGCATTGCA ACAACTCGCT
GCGGATCAGT TCGGCGAAGC CGACGAACTG CTGGAAGCGC AGCCATCAGC GCCACCGTCA
CCGATGGCAG AAGCGCTCAC CGGCGCGTTG CGTGGTGCTG CCGCCTCCCC AGGGTATGCC
ATCGGTCCGG CAGTGGTGCT GCGTCAGGTT GAACCACAGA TCGAACGGCG CATCATCAGT
GATCCAGATA CTGAAATGTC TCGTTTACAG GCGGTTTTGG ACGCAGTTCG CGAATCGACG
CGCGTGCTGC GCGACCAGAT TGCGCGGCAG CATCCCTACG AGGCGGCGAT CTTTGATGCG
TATCTGATGT TTTTGACCGA TCCCGATATT CTGGCGCGGG TGCGACAGAT TATCGCACGC
GACCGCGTTT GCGCCGAATG GGCATGGCGC GAGGCGGTGA ATGAATCTGC CAGGGCATTC
GAGTCTATCG AGGATGAATA CATGCGGGCG CGGGCAGTCG ATATTCGCGA CATTGGCAGG
CAGGTGTTGA GCCGTCTGAC CGGGCAGACT CGATCATTCA GTCTGGATCG GTCGGGTATC
GTGATTGCGT CCGATCTTTC ACCATCCGAT ACGGCGCACC TTGATCGGTC AATGGTGTTG
GGCATCTGCA CAGAACGGGG TAGCCCGACC TCGCACAGCG CCATTCTGGC GCGTACCCTC
GGCATCCCCG CCGTTGTGGG AGTAGGCGCC GCCATCACGC AGGTTGCTCC TGGTACGCCG
CTGGTGATTG ATGGGTATGA GGGGTTGGTC TGGATCGCGC CCGATGAGTC GATTGTTGTG
GCATATGCCA ATCGGGAAGC GCAATGGCGG GCAACGCAGG AACAGGCGCG ACAATCGAGC
ACCGCACCGG CCGTGACGAA GGACGGCATG CACATCGAGA TTGCCGCCAA TATCGGTAGC
CTGGCGGATG CGCGCGTCGC CGTTGAGAAC GGCGCCGATG GCGTGGGACT GCTCCGTACA
GAATTCCTCT TTCTTGATCG CACGGCGGCG CCTGATGAAA ACGAGCAGTA TGAGGTGTAC
GCTGCGATTG CGCGGGTGAT GGGGGAGCGT CCGGTGGTCG TGCGCACCCT CGATGTGGGA
GGTGACAAGC CGCTTGCGTA CATTTCGCTG GAGCGTGAAG ACAACCCGTT TCTTGGCCAA
CGCGCCATCC GGCTTTGCCT GAATCAACCA GATCTCTTTG CGACTCAACT GCGCGCTATT
TTGCGCGCGA GCGCCGGACA TCGACTCAAG GTGATGTTTC CAATGATTGC GGATATCGGC
GAGTTGCGCC GCGCACGTGC GGTCCTGGAG TCGGTGCTTG CCGGGTTGCA CACACAATCT
GTGCCGGTGG CGGATGCTGT CGAGGTTGGG ATAATGGTCG AGGTGCCCTC AGCCGCATTG
CTCGCACACG TCTTTGCGCC GGAAGTCGAT TTTTTCAGCA TTGGCTCGAA CGATCTGGTG
CAGTATACGC TGGCAGCAGA ACGGGGCAAT GCGGCGGTTG CGCATTTGCA GGACGGTCTG
CATCCGGCGG TGTTGATGCA AATCCAGCGC GTGGTTCAAA GCGCGCAACA TGCCGGGAAA
TGGGTGAGCG TCTGCGGCGA ACTGGCTGCC GATCATGATG CTGTGCCGGT CCTGATCGGA
TTAGGCGTGC AGAAACTGAG CATGGCGCCT GGCGCCATTC CGCACATCAA GGCGCTTATT
CGGCGACTGA CGCTGCAAGA AGCGCGGCAG TGGGCGAGTC AGGCGCTGGC AATGGAGTCG
GCGGAAACAG TGCGTCGGTT TATCCGTTCG CGGTTGGAGG CGCTTGTTGG TGAATAG
 
Protein sequence
MVSIVLVSHS SLLAAGIVEM ARMVMQQAPV AIAVAAGADD PGHPLGTDAA KIRQAIEEVY 
SDDGVLVLMD LGSAVLSAEM AVDFLPEHKR ANVRLCAAPI VEGTIAAVVQ ASLGASLDRV
AAEALEALAG KVESLSDQGQ ASGAAAPSPS TDATAEVLHA QLTVTNRLGL HARPAALLVQ
TAGRFCADVR LARVGQETRQ VNAKSFNAVA SLGIRQHEMI TVSARGPDAA EALAALQQLA
ADQFGEADEL LEAQPSAPPS PMAEALTGAL RGAAASPGYA IGPAVVLRQV EPQIERRIIS
DPDTEMSRLQ AVLDAVREST RVLRDQIARQ HPYEAAIFDA YLMFLTDPDI LARVRQIIAR
DRVCAEWAWR EAVNESARAF ESIEDEYMRA RAVDIRDIGR QVLSRLTGQT RSFSLDRSGI
VIASDLSPSD TAHLDRSMVL GICTERGSPT SHSAILARTL GIPAVVGVGA AITQVAPGTP
LVIDGYEGLV WIAPDESIVV AYANREAQWR ATQEQARQSS TAPAVTKDGM HIEIAANIGS
LADARVAVEN GADGVGLLRT EFLFLDRTAA PDENEQYEVY AAIARVMGER PVVVRTLDVG
GDKPLAYISL EREDNPFLGQ RAIRLCLNQP DLFATQLRAI LRASAGHRLK VMFPMIADIG
ELRRARAVLE SVLAGLHTQS VPVADAVEVG IMVEVPSAAL LAHVFAPEVD FFSIGSNDLV
QYTLAAERGN AAVAHLQDGL HPAVLMQIQR VVQSAQHAGK WVSVCGELAA DHDAVPVLIG
LGVQKLSMAP GAIPHIKALI RRLTLQEARQ WASQALAMES AETVRRFIRS RLEALVGE