Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_2895 |
Symbol | |
ID | 7268767 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 3553909 |
End bp | 3556368 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643567717 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_002464192 |
Protein GI | 219849759 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins [COG4668] Mannitol/fructose-specific phosphotransferase system, IIA domain |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00460973 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCTACAGC TCACGGCTGC GAGCGTTCGC CTCGGCGCAA GAGCCGCCTC GAAGGAAGAG GCGATCCGAC AAGTCGGGCA GGTGTTAGTC CAAGCCGGCC ATATCAAGCC TGCTTATATT GACAGCATGC TAGCTCGCGA GGCCTTGGCC AATACCTTCC TCGGCAACGG GATTGCTATT CCGCACGGTA AACCAGAAGA CCGCGATTTG ATCCTCGAAA CCGGTATTGC AGTGTTGCAA GTTCCAGAGG GCGTACCGTG GAATACCAAC GAAACGGCTC GTCTGATTAT TGGGATTGCC GCACGGTCTG ATGAACATAT CGATGTGTTA CGCCGTCTCA CCCGCGTGCT CGGTGATGCG GCACTCGTCA CCCGGCTTTG TCAGACTCGC GATCCAGCCG ATATTGTCGA AGTCTTGACC GGCGAACGAC CTACATCCAT AGCACAACCG GCGACCGATT ACGATCATGC CGTACAGGTG GTGATCCACA ATCCTACCGG TCTCCACGCT CGCCCGGCCA CTGCTTTCGT GGAGACGGCC AAGAAGTTTC AGGCGGCGAT CCGAGTGCGC TACGGTGATG CAGTTGCCGA CGGTAAGAGC TTGCTTGGTC TGCTGCAATT GGGCGTCACT GCGGGGGCGA TGGTGACAAT ATCGGCCCAA GGGCCAGATG CCGATGCTGC ACTCGTGGCA TTGCAGGCGC TGGTGGCTGC CGGTATGGGT GAAGAGCCGT CCGAAGCGGT TGCACCGCGT GTACAGGTGG CGCAACGCGA TTGGAAGCCG CAGCACGTCG CTGCAACCAT CGAGGGTATC CCGGCGGCTG AGGGGTTGGC GGTGGGTCCG ATTCGCCATT ATCGACGGGT ACCGTTGGTA GTTACCGATA AACCCGGTGA CCGAATGATC GAGGCGGCTG CGCTCGAACA AGCGCTTGTT GCAGCCCGTA ATGAGCTAGC GGTGGTCGCC GATGAAGTGG CACGCCGGTT GGGATCATCG CAGGCAGCCA TCTTTCGTGC CCATACCGAG TTGCTTGCCG ATCCAGGATT GGTGCGTGAG ACGGTGAGTC GGATTTTTGA TGGTCATAGC GCGGCGTGGG CATGGCAGCA GACGATTGCG GCCCGAGTGG CCCAGCTCGC CAGGCTCGAT GACCCGGTGT TAGCCGGGCG GGCGGTTGAT CTGAGTGATG CCGGCCAACG GGTTTTGCGG CATCTGTTGG GTCTCGGTGA GATACCGCAT CTTAGTCTGG CCGAGCCGGC GATTATTGTG GCCGATGATC TTACACCCTC TGATACGGCC TCCCTCGATC CCGATAGGGT TTTGGGCTTG TGTACCGCAC ATGGCGGCCC GACATCGCAT ACCGCGATCA TTGCCCGTTC GCTTGGTCTG CCGGCCATCG TTGCTGCCGG CGAGGCGGTG CTCGATGTGC CTGAAGGTAC ACCGGCCATT CTCGACGGGT TCAACGGTGC GTTCTATCTA CGACCCTCGG CTGCCGATAT CGAGACGGCA CGTGCATTGC GGGCCGGCCT TGATCAGGCA CAAGCTATCG CGTTTGCTGT GCGGCATCAA CCGGCCATAA CTCGTGACGG TGTCCATATT GAGGTCGCTG CTAACGTGAA TCGGGTGGCA GATGCTGCAC GTGCTATCCA GAATGGTGCT GATGGGGTTG GCCTGATGCG TACCGAGTTT CTCTTTCTCG AACGCGATAG CGCTCCTGAT GAGGATGAGC AATACCAGGC TTATCGAACA ATGGTCGAGA CCATGGCCGG TCGGACGTTG ATTATTCGTA CTCTCGACAT CGGCGGCGAT AAAGAAGTAC CGTATCTCAA TATTCCACGT GAAGACAACT CATTCCTCGG TATTCGTGGC TTACGGTTGT GTCTGCGTCG CCCAGAACTG TTTGAGCCAC AGTTACGGGC GATCTTCCGG GCCGCCAAGC ATGGTCCGCT CAAAATCATG TTCCCGATGG TCGCGACCTT GGAAGAGGTG CGACAAGCCA AAGCAATTGC CGAGCGTATC CGTGCTGAAC TGAACGCGCC GCCGGTTGAG ATCGGGATTA TGGTTGAGGT GCCATCGGCG GCGATGCTGG CCGATGTGCT GGCTGCCGAG GTCGATTTCT TCTCAATCGG CACCAACGAT CTGACCCAGT ATGTGCTGGC GATGGATCGG CTGCATCCCG AATTGGCCCG CCAAGCCGAT AGCCTGCATC CAGCGGTGCT GCGCATGATT GCCCGCACTG TCGAAGGGGC GGCGAGCGCC GGACGTTGGG TAGGAGTGTG CGGCGGCATC GCCAGCGATC CGTTCGGCGC AGCGATTTTG GTTGGTTTAG GCGTGCATGA GTTGAGCGTC AGCATTCCCA GTGTCGCGAC CATTAAAGCG CATCTGCGCG GCCTCAGCGT CGCCGAGCTG CGCGAGCTGG CGTGGCGGGC GCTGGCGTGT CGCAGTGCAG CGGAGGTGCG TGCGCTATGA
|
Protein sequence | MLQLTAASVR LGARAASKEE AIRQVGQVLV QAGHIKPAYI DSMLAREALA NTFLGNGIAI PHGKPEDRDL ILETGIAVLQ VPEGVPWNTN ETARLIIGIA ARSDEHIDVL RRLTRVLGDA ALVTRLCQTR DPADIVEVLT GERPTSIAQP ATDYDHAVQV VIHNPTGLHA RPATAFVETA KKFQAAIRVR YGDAVADGKS LLGLLQLGVT AGAMVTISAQ GPDADAALVA LQALVAAGMG EEPSEAVAPR VQVAQRDWKP QHVAATIEGI PAAEGLAVGP IRHYRRVPLV VTDKPGDRMI EAAALEQALV AARNELAVVA DEVARRLGSS QAAIFRAHTE LLADPGLVRE TVSRIFDGHS AAWAWQQTIA ARVAQLARLD DPVLAGRAVD LSDAGQRVLR HLLGLGEIPH LSLAEPAIIV ADDLTPSDTA SLDPDRVLGL CTAHGGPTSH TAIIARSLGL PAIVAAGEAV LDVPEGTPAI LDGFNGAFYL RPSAADIETA RALRAGLDQA QAIAFAVRHQ PAITRDGVHI EVAANVNRVA DAARAIQNGA DGVGLMRTEF LFLERDSAPD EDEQYQAYRT MVETMAGRTL IIRTLDIGGD KEVPYLNIPR EDNSFLGIRG LRLCLRRPEL FEPQLRAIFR AAKHGPLKIM FPMVATLEEV RQAKAIAERI RAELNAPPVE IGIMVEVPSA AMLADVLAAE VDFFSIGTND LTQYVLAMDR LHPELARQAD SLHPAVLRMI ARTVEGAASA GRWVGVCGGI ASDPFGAAIL VGLGVHELSV SIPSVATIKA HLRGLSVAEL RELAWRALAC RSAAEVRAL
|
| |