Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4116 |
Symbol | |
ID | 5541627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5327651 |
End bp | 5329321 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896228 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001434166 |
Protein GI | 156744037 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) |
TIGRFAM ID | [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.023683 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATCT ATCTTCGAGG CGCTGGGAGT TCACCGGGAG TGGCGCTTGG GCGTGCGGTG CGCTACCTCC CCGATAGCCA CGCCTGGCAT GCGGTTGATG CCGACATCGA TGCCGCAATG GCGCGTTTCA CAGCAGCTCA GGCAATGGCT GCCACGCAAA TGCGCACGCT GGCAGAGTTG TTGCGTGAAG AAGGACGCAT CGAAGAGGCG CGCATTTTTG ACACCCATGC GCTCCTGGTT GAAGATGAAA TCCTGACGCA GGACGTAGAA CGGCGTATGC GCGCGGGGCG CATCAGTCTG GAGCAGGCGC TGATCGCCGC CATCGATTCG CTGCGCGACG CCGTCGATGC CATCGACGAC CCCTATCTAC GCGAACGTTC CAGCGACATC GACAGCGTGC GGCGCGCTAT TCTGACGGCG CTGCACGGCG AAACCCGCCG CATTCGCGAT CTGCCGATCG GCGCCATTCT GGTGGCGAAT GACCTGACGC CAGCGGAAGC GGTCAGCCTG CGCGATGGAC GGATCGCCGG ATTCGCAACT GCCGAGGGTG GACCGACCAG CCATACGACG ATCCTGGCGC GCGCCTTTGG CATCCCGGCG GTTGTCGGGT TGGGCGCAGC AACGCTGGCG GTTCCCGATG GCGCGCCACT GGTGCTCGAC GGATACACGG GACTGCTGAT CGTCGATCCT GACGCCTTTG AATGGTCCTC CTACGAACGT CGCGCGTCCG CGCTGGTAAC GGCGCCGGTT CGGCGACAAC CGTCACGCGA TCAACCGGGG CGCCTGGCAA GCGGCGAGCC GGTGACCATC TGGGCAAATA TCAACCATCC GCTCGAGGCG CGTATCGCCC TCGAACAGGG AGCAGAAGGC ATCGGACTGT TTCGCACCGA GTTTCTCTTC CTGGGGCGTA GCACTCCGCC CGACGAGAAC GAACAGTACG AGGCATATCG CGCCGTAGTC GAGATGATGG AAGGGCGCCC GGTCATTATC CGCACCCTGG ACATCGGCGG CGATAAGCGA GTGGAGTACC TCGACCTGCC GCACGAACCC AATCCCTCAC TCGGTATCCG TGGGCTGCGC CTGGCAATGC GTCGGCCCGA TCTCTTCCAG ACACAGATTC GCGCTATGCT TCGCGCTGCA ACGCACGGCG ATCTGCGTAT CCTGTTGCCG ATGGTCGCCA TACCGGACGA AGTGACATGG GCACGTGAAC AGATCCACAG CGCCGCCGAG TCGCTGGCAC GTCAGGGCAT TCCTCACCGT GCTGACGTGC CTGTTGGCGT CATGATCGAA ACGCCAGCCG CAGCAATCAC TGCCGATCTG CTGGCGCGCG AGGCGGCGTT CTTCAGCATT GGCACCAACG ACCTGGCGCA GTACGCGCTT GCTGCCGACC GCACGAGCGC CGATGTGTCT GCCCGATACT CCCAGACATC CGCTGCTATC CTGCGCCTGA TTGCGCAGAC CGTCGGCTCT GCCATTCGCG CTCGTTTGCC GGTGTGTGTC TGCGGCGAGA GCGCCGGTGC GCCGGATGTG GCGCCGCTCC TGATCGGGTT GGGCGTGTCA CAATTGAGCA TGAACCCGGC GAGCATTTCC ATTGTCAAAG AGCGTCTGAG CGAGACGATG ATGACGCAGG CGCAGGCGGC GGCGCACGCA GTGTTGAACA TTTACATATG A
|
Protein sequence | MAIYLRGAGS SPGVALGRAV RYLPDSHAWH AVDADIDAAM ARFTAAQAMA ATQMRTLAEL LREEGRIEEA RIFDTHALLV EDEILTQDVE RRMRAGRISL EQALIAAIDS LRDAVDAIDD PYLRERSSDI DSVRRAILTA LHGETRRIRD LPIGAILVAN DLTPAEAVSL RDGRIAGFAT AEGGPTSHTT ILARAFGIPA VVGLGAATLA VPDGAPLVLD GYTGLLIVDP DAFEWSSYER RASALVTAPV RRQPSRDQPG RLASGEPVTI WANINHPLEA RIALEQGAEG IGLFRTEFLF LGRSTPPDEN EQYEAYRAVV EMMEGRPVII RTLDIGGDKR VEYLDLPHEP NPSLGIRGLR LAMRRPDLFQ TQIRAMLRAA THGDLRILLP MVAIPDEVTW AREQIHSAAE SLARQGIPHR ADVPVGVMIE TPAAAITADL LAREAAFFSI GTNDLAQYAL AADRTSADVS ARYSQTSAAI LRLIAQTVGS AIRARLPVCV CGESAGAPDV APLLIGLGVS QLSMNPASIS IVKERLSETM MTQAQAAAHA VLNIYI
|
| |