Gene Rcas_4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4116 
Symbol 
ID5541627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5327651 
End bp5329321 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content63% 
IMG OID640896228 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001434166 
Protein GI156744037 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.023683 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCATCT ATCTTCGAGG CGCTGGGAGT TCACCGGGAG TGGCGCTTGG GCGTGCGGTG 
CGCTACCTCC CCGATAGCCA CGCCTGGCAT GCGGTTGATG CCGACATCGA TGCCGCAATG
GCGCGTTTCA CAGCAGCTCA GGCAATGGCT GCCACGCAAA TGCGCACGCT GGCAGAGTTG
TTGCGTGAAG AAGGACGCAT CGAAGAGGCG CGCATTTTTG ACACCCATGC GCTCCTGGTT
GAAGATGAAA TCCTGACGCA GGACGTAGAA CGGCGTATGC GCGCGGGGCG CATCAGTCTG
GAGCAGGCGC TGATCGCCGC CATCGATTCG CTGCGCGACG CCGTCGATGC CATCGACGAC
CCCTATCTAC GCGAACGTTC CAGCGACATC GACAGCGTGC GGCGCGCTAT TCTGACGGCG
CTGCACGGCG AAACCCGCCG CATTCGCGAT CTGCCGATCG GCGCCATTCT GGTGGCGAAT
GACCTGACGC CAGCGGAAGC GGTCAGCCTG CGCGATGGAC GGATCGCCGG ATTCGCAACT
GCCGAGGGTG GACCGACCAG CCATACGACG ATCCTGGCGC GCGCCTTTGG CATCCCGGCG
GTTGTCGGGT TGGGCGCAGC AACGCTGGCG GTTCCCGATG GCGCGCCACT GGTGCTCGAC
GGATACACGG GACTGCTGAT CGTCGATCCT GACGCCTTTG AATGGTCCTC CTACGAACGT
CGCGCGTCCG CGCTGGTAAC GGCGCCGGTT CGGCGACAAC CGTCACGCGA TCAACCGGGG
CGCCTGGCAA GCGGCGAGCC GGTGACCATC TGGGCAAATA TCAACCATCC GCTCGAGGCG
CGTATCGCCC TCGAACAGGG AGCAGAAGGC ATCGGACTGT TTCGCACCGA GTTTCTCTTC
CTGGGGCGTA GCACTCCGCC CGACGAGAAC GAACAGTACG AGGCATATCG CGCCGTAGTC
GAGATGATGG AAGGGCGCCC GGTCATTATC CGCACCCTGG ACATCGGCGG CGATAAGCGA
GTGGAGTACC TCGACCTGCC GCACGAACCC AATCCCTCAC TCGGTATCCG TGGGCTGCGC
CTGGCAATGC GTCGGCCCGA TCTCTTCCAG ACACAGATTC GCGCTATGCT TCGCGCTGCA
ACGCACGGCG ATCTGCGTAT CCTGTTGCCG ATGGTCGCCA TACCGGACGA AGTGACATGG
GCACGTGAAC AGATCCACAG CGCCGCCGAG TCGCTGGCAC GTCAGGGCAT TCCTCACCGT
GCTGACGTGC CTGTTGGCGT CATGATCGAA ACGCCAGCCG CAGCAATCAC TGCCGATCTG
CTGGCGCGCG AGGCGGCGTT CTTCAGCATT GGCACCAACG ACCTGGCGCA GTACGCGCTT
GCTGCCGACC GCACGAGCGC CGATGTGTCT GCCCGATACT CCCAGACATC CGCTGCTATC
CTGCGCCTGA TTGCGCAGAC CGTCGGCTCT GCCATTCGCG CTCGTTTGCC GGTGTGTGTC
TGCGGCGAGA GCGCCGGTGC GCCGGATGTG GCGCCGCTCC TGATCGGGTT GGGCGTGTCA
CAATTGAGCA TGAACCCGGC GAGCATTTCC ATTGTCAAAG AGCGTCTGAG CGAGACGATG
ATGACGCAGG CGCAGGCGGC GGCGCACGCA GTGTTGAACA TTTACATATG A
 
Protein sequence
MAIYLRGAGS SPGVALGRAV RYLPDSHAWH AVDADIDAAM ARFTAAQAMA ATQMRTLAEL 
LREEGRIEEA RIFDTHALLV EDEILTQDVE RRMRAGRISL EQALIAAIDS LRDAVDAIDD
PYLRERSSDI DSVRRAILTA LHGETRRIRD LPIGAILVAN DLTPAEAVSL RDGRIAGFAT
AEGGPTSHTT ILARAFGIPA VVGLGAATLA VPDGAPLVLD GYTGLLIVDP DAFEWSSYER
RASALVTAPV RRQPSRDQPG RLASGEPVTI WANINHPLEA RIALEQGAEG IGLFRTEFLF
LGRSTPPDEN EQYEAYRAVV EMMEGRPVII RTLDIGGDKR VEYLDLPHEP NPSLGIRGLR
LAMRRPDLFQ TQIRAMLRAA THGDLRILLP MVAIPDEVTW AREQIHSAAE SLARQGIPHR
ADVPVGVMIE TPAAAITADL LAREAAFFSI GTNDLAQYAL AADRTSADVS ARYSQTSAAI
LRLIAQTVGS AIRARLPVCV CGESAGAPDV APLLIGLGVS QLSMNPASIS IVKERLSETM
MTQAQAAAHA VLNIYI