Gene Cagg_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1646 
Symbol 
ID7268948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2006569 
End bp2008881 
Gene Length2313 bp 
Protein Length770 aa 
Translation table11 
GC content56% 
IMG OID643566488 
ProductAAA-4 family protein 
Protein accessionYP_002462983 
Protein GI219848550 
COG category[R] General function prediction only 
COG ID[COG0613] Predicted metal-dependent phosphoesterases (PHP family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAG TAAGAGAAGA GCGCAACGGG ATGCGCTGGA TTCGGATTGA TTTGCACCTG 
CACACGCCTG CATCTGAGGA TTACGCCGAA CCAAACGTTT CTTACCTTGA CATTCTTCAA
GAAGCTGAGC GTCGCGGTCT TGAGATTATC GCCTTTACCG ACCACAATAC GGTTGCCGGC
TACGAGCAGT TTCAGCGCGA GATTGAGTTT CTGACGACCC TGGAAAAGGC CGGACGGTTG
ACCGATGATG AAGAAGCTCG TTTGGCTGAG TATCGTCGGT TGCTCGATAA GATCACCGTC
TTGCCCGGCT TTGAGTTTAC GTCGCACTTC GGTGCCCATA TCCTCGGTAT CTTTCCGCCG
AACCGCCCGC TTAGCTTGAT CGAGGCTACG TTGTTGCAGC TCGGTATTCC CGCCGAAGTT
CTGAAGGGTG GGGTGTGTAG TGTCGCTAAT ACCCGGCACG TGACCGAGGC ATACGAGATT
ATTCATCGCG CTGGCGGTCT TGTGATCGCG GCGCATGCCA ACGGGCCAAA CGGAGTGATT
ACCGAAACCC TCCGTATGGG GACAAGTGGC CAGGCTCGTG TGGCGGTGAC CCAAAGCCCC
TATCTCCACG CACTGGAGTT TATCAATTTC TATACCGATC ACGAGAAGTT TACCTCACCC
GGTTTTTACA ACGGTAAGAC CGAGCATTAC GAGCGGCGGA TGTTCTGTAT TCAGGGTAGC
GATGCACATC GGCTGCGTCG CTCTGCTGAG TCTGATGCCC AAGCGACCCA CCGCCACGGC
ATTGGTGACC GCTATTTTGA GGCGCTGTTG CCCGACCGTA GTTTTGAAGC CCTGAAGATG
CTCTTTACCG GTCAGGATTT TGATCGAGTG CGGGTGCCGA AGCGTGATCA GAAGCAGTGG
TCACTCGATG TCGTGCGCTT TAGTGGGAGT ACCGACCGCC AAATCTTGCG CGCTGTTCCC
GATCCGACAA CGGCAGCCGC GCTCTGGCCT GATGTGGCGG CGTTGGCAAA TATCGGTGGT
GGGGTGCTCG TGATCGGCTG TGAGCCTGGG GGTAAGGTGA TCGGTGTTGA ACGGCCCGAT
CAGCTTACCG AGTCGTTACG GCAAAGTGTG CAAGAGCATA TTACGCCGCT GCCCTACTTG
TCGTTTGAGT TGATGCACTA CGAAGGGCAA GACGTGATCC GCGTTGAGGT CAAAGCGCAG
GATCCGCCAC CTTACGTAGG GAGTAACGGT ACGATCTACA TCCGGCGCGA TAACAAGACG
TTTCCTGCGA ACCGCAGTGA AATTATCCAA TTGTGTCGCC AGGCGATTGC ATCCGGTGAA
CCTTCATCAC TCGATAACGG CGAGACGTTG GAACTACCGC GTTCGGGCGT CGAGATTGTC
AGTAGCCAGC GTCGTGGTGG TACGTGGGTG TATGAAGTGC GTGATCTGCG TACTACCGCC
GGTGTGACCC GTGATCGTGC CCAAGGGTTG TGGGCTTACG CAATCGATCG CCACGAAGAT
TTGCGTGATG GTCGGCTCGA TCTCCAGAGT CAAGTGCGTT GGCGCGGCCG GCTCGGTCTC
TGGCGGGCGT ATCGCCAAGG TTCGCGTGTG AAGTACGATC TGGTCCATCG TGACCCGAAT
GGTGTGATTG ACCATATTTT CTACGGTGTG AGCGATTGGG GTCTCGGTGA AGCGTGGATG
AGCCTTTTGA ATGAAGCCGG TGCGCGCATT GAGACGGAGA CAGCCGATTT CGATCAGGAA
GATGAGATGG AGGTGCCGCC GCCACCAGAT ATTGAACCAT GGGGGGAACG GCGGATCCGC
TGGCGTGGTC GTGGTGGTTT AGTACGCATT TTCCTCGGTG ACGATGGGCA ACCGCGGTTT
GATCTGGTGA TGAAGGATAA AGAAACCGGT GTCGTGCAAG AATACAACAA TGTGCCGCGC
GAGAAGCTTT CCGAGGCATG GCTGGCGTTG ATCCGCGTTG CCCGTCCGCG TACCGGTATC
GAGGTGGTAA GTGCTAGTCG TAGCGAAGAT GGCGATTGGC TCTACGTCTT CCGTAATCTG
CGTACCGGCG AGATTAGTAG TGCGCCATGG CGGTTGCAAG ATATCGAACC CGGTACGGTG
CGTGAGTATG CGGCGCGTAT GTACCACCAA GATATTCCGC TCGATCAAGC GAAGGTGCGC
TGGTGGGGAA ATATTGGCTA TTTGCGCCCA ATGCGATCGC AGGTCGATTT GGTCTATGTT
GATGAGTACG GCATGACTCA CATCTACTAC GCTGCCCGGC GTGATGAATT GACCGGTGAG
TGGCGAGAGT TGCTCCAACT GTATGGCGAG TAG
 
Protein sequence
MSTVREERNG MRWIRIDLHL HTPASEDYAE PNVSYLDILQ EAERRGLEII AFTDHNTVAG 
YEQFQREIEF LTTLEKAGRL TDDEEARLAE YRRLLDKITV LPGFEFTSHF GAHILGIFPP
NRPLSLIEAT LLQLGIPAEV LKGGVCSVAN TRHVTEAYEI IHRAGGLVIA AHANGPNGVI
TETLRMGTSG QARVAVTQSP YLHALEFINF YTDHEKFTSP GFYNGKTEHY ERRMFCIQGS
DAHRLRRSAE SDAQATHRHG IGDRYFEALL PDRSFEALKM LFTGQDFDRV RVPKRDQKQW
SLDVVRFSGS TDRQILRAVP DPTTAAALWP DVAALANIGG GVLVIGCEPG GKVIGVERPD
QLTESLRQSV QEHITPLPYL SFELMHYEGQ DVIRVEVKAQ DPPPYVGSNG TIYIRRDNKT
FPANRSEIIQ LCRQAIASGE PSSLDNGETL ELPRSGVEIV SSQRRGGTWV YEVRDLRTTA
GVTRDRAQGL WAYAIDRHED LRDGRLDLQS QVRWRGRLGL WRAYRQGSRV KYDLVHRDPN
GVIDHIFYGV SDWGLGEAWM SLLNEAGARI ETETADFDQE DEMEVPPPPD IEPWGERRIR
WRGRGGLVRI FLGDDGQPRF DLVMKDKETG VVQEYNNVPR EKLSEAWLAL IRVARPRTGI
EVVSASRSED GDWLYVFRNL RTGEISSAPW RLQDIEPGTV REYAARMYHQ DIPLDQAKVR
WWGNIGYLRP MRSQVDLVYV DEYGMTHIYY AARRDELTGE WRELLQLYGE