Gene Cagg_0278 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0278 
Symbol 
ID7267459 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp349590 
End bp351095 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content57% 
IMG OID643565148 
ProductPpx/GppA phosphatase 
Protein accessionYP_002461662 
Protein GI219847229 
COG category[F] Nucleotide transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0248] Exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00854606 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.782575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TTGGCGTGAT CGACCTCGGT TCGAATACGA CGCGGCTCAT TGTGATGGCG 
TATGAGCCGG GATATAGCTT TCGCCTCACC GATGAAGTCA GTGAAACGGT GCGCCTCGCG
GAGGGGATTG GGCCGTCTCG TATGCTGCAA CCGACACCCA TTCGCCGTGC CGTCGAAGCG
CTGCATATGT TCTACTCACT CTGTCAGTCG ACCGGTGTCG ATCAGGTGAT CGCCGTCGGA
ACGAGTGCCA TCCGCGAAGC CGCCAATCAA GCTGAGTTTT GGACGGCGTT GCGCGAGGTA
ACAGCGCTCG ACTTACGGAT TATCTCGGCA GAAGAGGAGG CATATTTTGG CTATTTGGGT
GCGATCAACG CTTTACCCCT TCATACCGGC GCCCTGTTCG ATACCGGTGG CGGTTCGACC
CAGGTCATGG CGGTGCGCAA TCGTGAACTC ACCCAGAGTT TTTCTGTGCA AGCCGGTGTC
GTCCGCTTTA CCGAGCAGTA CGTGCAGAGT GATCCGGTCA GCCGCGCCGA TCTCCGCCGG
TTGCGTGAGG CCGCGCACAC AGCATTTGCC CCGATCGATT GGATCGCCGA CCTCAGCGAC
GGTAAATTGG GCGGGATTGG AGGCACCGTG CGTCAACTGG CCCGGATTGA TCAAAAAATG
CGCCATTACC CACTTGAACG GGTGCATGGC TATGTCCTCT CCCGTGAGGC GATCGAACGC
ATCATTGAAG AATTGGCCCG TCGCTCGCGC CGCGATCGTT TGCAGATCCC GGGAATGAAA
GAGGAGCGGG TAGATATTAC CTTGGCCGGT GCCGTGATTA TTGCCACATT GATGGATCGC
GGAGGATTTT CTAGCCTGCT GGTGAGCGGA CAAGGTGTGC GCGAAGGTAT TTTCTATCAG
CACTTCCTCG CCGATCAACC GCAACCCTTG ATCGCCGATC CACGACGTTT CAGCGTGCTC
AATCTCGCTC ATCTCACCCA CTACGAACCA CAGCACTGCG AGCGTGTTGC CCACCTGAGT
CTGTCGTTGT TCGATCAGTT AGCACCCCTC CACGGCTATG GTGCCTGGGA ACGTGAGCTA
CTCGGCTATG CCGCCATTCT TCACGACATC GGTATCAGTG TTGGCTATTA CGACCATCAC
AAACACGGTG AGTACCTCAT CCACAACGCA ACGTTGCTCG GCTTCAGTCA TCGCGAGATC
GTGATTATCG CCAGTTTGGT ACGTAATCAT CGCAAGGGCT GGGGTGAATT AGCCCCCTAC
GCGCCCATCC TAACCCCAGA TGACGAAACC CGCATCGCGC GTCTGAGTGC GATGCTGCGC
CTGTCCGAAT ACCTTGAGCG GAGCAAAAGT CAGATCGTGC GCGATGTGCG CGTACAACTC
GGCGAAGAGA TCGTCATTGA CATTCTGGCC GATGGTGATG CGCACGTTGA AATCTGGGAA
GCCTCGCGCC GGAGTGGATT GCTCCGACGT AGCTTCGGGC GTGATGTGCA GATCAGAGCC
GCATGA
 
Protein sequence
MKKLGVIDLG SNTTRLIVMA YEPGYSFRLT DEVSETVRLA EGIGPSRMLQ PTPIRRAVEA 
LHMFYSLCQS TGVDQVIAVG TSAIREAANQ AEFWTALREV TALDLRIISA EEEAYFGYLG
AINALPLHTG ALFDTGGGST QVMAVRNREL TQSFSVQAGV VRFTEQYVQS DPVSRADLRR
LREAAHTAFA PIDWIADLSD GKLGGIGGTV RQLARIDQKM RHYPLERVHG YVLSREAIER
IIEELARRSR RDRLQIPGMK EERVDITLAG AVIIATLMDR GGFSSLLVSG QGVREGIFYQ
HFLADQPQPL IADPRRFSVL NLAHLTHYEP QHCERVAHLS LSLFDQLAPL HGYGAWEREL
LGYAAILHDI GISVGYYDHH KHGEYLIHNA TLLGFSHREI VIIASLVRNH RKGWGELAPY
APILTPDDET RIARLSAMLR LSEYLERSKS QIVRDVRVQL GEEIVIDILA DGDAHVEIWE
ASRRSGLLRR SFGRDVQIRA A