Gene Cagg_2007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2007 
Symbol 
ID7269165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2453052 
End bp2454722 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content60% 
IMG OID643566841 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_002463331 
Protein GI219848898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) 
TIGRFAM ID[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGA TGCTGCAAGG TGCCGGTGGT GCTGCTGGGT TGGCGTTGGG TCCTGCGTAC 
CGCTGGCAGC GTGTGGCTAC GGTAGCAGTT GATCCGCCTC ACGAGTCGGT AGAAGCGGCA
TTGGCTCGCT TTCATGCAGC ACAACGTGCC GCCGCAGCAC GCTTGCGTGC TATCGCCGAA
CGGCAACGAG CAGCCGGTTT GCACGAAGCC GATTTGTTCG ATGCACAAGC TTTGCTGGTC
GAAGATGAGA CCTTGACCGA TGGGGTGACG GCGCTGGTGC TTGATGGGCA GCCGTTGACG
ACGGCAATTC GCACAACAGT GGCGCAAATG CAGGCGTTGC TCGCCGATCT TGATGATGAG
TATCTGCGGG AACGTGCGGC TGATATGGCC GCGGTTGGGG TGGAGTTGTT GCATGCGCTG
GCCGGCGAGA CCGCATCGCA GCCAACTGTC CCGCCCGACG CTATTGTCGT TGCTGATGAT
TTGACGCCCG CCGAAACGGT CGACCTACCG CACCACGTTG CCGGTTTTGC TACTGCCGAT
GGTGGTCCGA CCGGTCATAC TGTTATTCTT GCCCGCGCAC GAGGTGTGCC GGCAGTAGTA
GGGGTAGGTG ACGAAATCCT CGCTGTGCCC GATGGCGTAC AGCTTTTGAT CGATGGCGAT
GCCGCGACGG TATTGATCGA CCCCGATGAA GCAGCGTTGC AGTCGGCTCA AGTGCGGATG
GAGGCGTTGC GAGTGCTGCA ACGGCGACAA GCGGCGTTGC GGGATCAGCC CGGTCAGTTG
CGTGATGGCC GCTTAGTTGG ATTGTGGGCC AACATTGGTC GTCCGGCTGA GGCGCGATTG
GCCCGTGAGT ACGGGGCCGA AGGTATCGGA TTATTTCGGA CGGAGTTTCT TTTTCTCGAC
CGTTCGGCGC CACCAGATGA AGATGAGCAG TATACGGCAT ATTGCGCGGT GTTGGATGAG
TTGCCCGGCA AGCCGGTAGT GATCCGCACA CTCGATATCG GTGGTGATAA GCCGTTGCCG
TATCTCCCAC TTTCCCCTGA AGCGAACCCG TTTCTTGGGG TGCGAGGGTT GCGGCTCTCG
ATGCAGCGCC CCGATCTCTT CCAAATCCAA TTGCGTGCGT TGTTACGGGC AGCGTTTCGT
GGCGATATTT GGATTATGCT ACCGATGGTT GCCACTCCAG CCGATCTCGC GTGGGCGCGT
GCGCAGTTGG TGGAAGCGGC GGCAGCCTTG GCAGCGGCCG GTGTTGATCA TCGGCCCGAT
CCACCACTGG GTGTGATGAT CGAGACGCCG GCGGCGGCGG TGTTAGCCGA TCAACTGGCA
CGAGACGCGG CCTTCTTTAG CATTGGGAGC AACGATTTGG CTCAGTATAC ACTGGCGGTT
GATCGTGGTC ATCCTACCCT GGCGGCTCGT TATCCCTCCA ATGATTCGTC GGTCTGGCGG
ATGATCGATC TGGCTGCGCG TGCTGCACAG CAGGCCGGTA TTCCGATTGG TATTTGTGGT
GAGCTTGGTG GTGAACCAGA TGCCGCTCCA GCTCTGGTGG GCTTAGGCTT GCACGAGTTG
AGTATGGCCC CGGCTCGTAT TCCGGCAGTC AAGGAACGAC TGCTGCAAAC CTCATGGGCT
GAAGCACAAG CGGCTGCGGC GCGGGCGCTT GCGGGGTGGC GAGAAGCATA A
 
Protein sequence
MGKMLQGAGG AAGLALGPAY RWQRVATVAV DPPHESVEAA LARFHAAQRA AAARLRAIAE 
RQRAAGLHEA DLFDAQALLV EDETLTDGVT ALVLDGQPLT TAIRTTVAQM QALLADLDDE
YLRERAADMA AVGVELLHAL AGETASQPTV PPDAIVVADD LTPAETVDLP HHVAGFATAD
GGPTGHTVIL ARARGVPAVV GVGDEILAVP DGVQLLIDGD AATVLIDPDE AALQSAQVRM
EALRVLQRRQ AALRDQPGQL RDGRLVGLWA NIGRPAEARL AREYGAEGIG LFRTEFLFLD
RSAPPDEDEQ YTAYCAVLDE LPGKPVVIRT LDIGGDKPLP YLPLSPEANP FLGVRGLRLS
MQRPDLFQIQ LRALLRAAFR GDIWIMLPMV ATPADLAWAR AQLVEAAAAL AAAGVDHRPD
PPLGVMIETP AAAVLADQLA RDAAFFSIGS NDLAQYTLAV DRGHPTLAAR YPSNDSSVWR
MIDLAARAAQ QAGIPIGICG ELGGEPDAAP ALVGLGLHEL SMAPARIPAV KERLLQTSWA
EAQAAAARAL AGWREA