Gene Cagg_0213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0213 
Symbol 
ID7269127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp263977 
End bp266898 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content57% 
IMG OID643565082 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_002461597 
Protein GI219847164 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCTCT ATCTCGTGAC AGTCGCGACG CGCACTGCCG TATCACAGAC ACTCTACCTA 
CTCGACGGAC CTGAACTTAC ACCGGTAGTA GTTTCGCAAC TGGTCGAACA ACTCCTTCAC
GATCCTGTCG TTCAGCAAGC TCACTGGCAA ATGCTCACAG ATGACCAACC CCCGACCGAT
CCTTTTTTAC CCACAACTGA TGCGCTCGTG GTGGAAGTTG CCTATCGTCC CGGTGTGACC
GATAGCGAAG GGGAAAGTGT GATCGAGGGA GCACGCCGAT TAGGTGTGCG TGGCTTGACG
CATGCGCGTG CGCTGCGACG TTATGTGTTG CCACCGGAGA CCGATCCGGT GCAGGCGGCG
GCCGAGTTGG CGATTGAGGT CGTGCATACT ACGATTGCGT ACCGTACCGG CCAAGCAACG
ACGGCACGAG CAGCATTCTA TGCGATGTTG GTCGCCGAAC CGGCCCCGAC CAGGCCGGTC
GTTGCCAATA TTCCCCTATG CACCGCCGAT GATGACGAAT TACTTAACAT AAGTCAGCAA
GGTGTTCTCG CGCTTGATTT GGCCGAAATG CGCGCCATTC AACGCTATTT TGCCGGACAA
GGTCGCGATC CGACCGACGG AGAACTCGAA ACCATTGCCC AAACCTGGAG CGAGCACTGT
TCACACAAGA CGTTCAAAGC GCGTGTTCGG TATGAACAAC CACCGGAGAC ATTACCGGTC
AACGCCGATT TACATCCGAT GCTTGCGCGA TTGGCAAGCG GTCAGATGGA GATTGATAGC
TTGATCCGCA CCTTTCTGAT GCGGGCAACC GAACAGGTGT TGGCCAAGCG TCATGATGAA
TGGGTACTTA GCGCGTTTGT TGATAATGCC GGAATTGTCG CATTCGGGCC AAACTATGAG
GTTTCGTATA AGGTCGAGAC GCACAACCAT CCGAGTGCGC TTGAGCCGTT TGGTGGAGCG
AATACCGGTG TAGGCGGGGT CATTCGTGAT GTATTGGGCG TATCGGCCCG CCCGATTGCC
AATATCGACG TGCTCTGTTT CGGCATGCCC GATGCACCGC CGCCGCCACC TGGTGTCCTC
CATCCACGTC GGGTGGCAAG CGGTGTAGTT GCGGGTGTGC GCGACTATGG CAACAAACTC
GGTATTCCAA CGGTGGGTGG AGCGGTATTG TTCGATCCCG GATATACTGC CAACCCGCTC
GTCTATTGCG GGACCGTTGG GATAGCACCG CGCGGCCTAC ACCCACGTAA TGTACGTCCC
GGCGATATTA TTGTGGTGAT GGGAGGACGG ACGGGGCGCG ACGGTATTCA TGGTGCGACA
TTTTCGAGTA TCGAATTGAC GCATACCACT GCGGTTGAGG TGGGCAGTGC GGTGCAGATC
GGCGATCCGA TCACTGAAAA GAAGATGCTC GATGTTCTGT TGCAGGCACG CGATGCCAGG
CTGTACTCGG CGCTTACCGA TTGTGGCGCC GGGGGATTAT CTTCGGCAAT CGGTGAGATG
GGCGCCGAAT TGGGTGCCGA GGTTCACCTT GAGCGTGTGC CGTGCAAATA TGCCGGCTTG
CAACCATGGG AGATTTGGCT TTCGGAGGCG CAAGAGCGAA TGGTGCTCGC CGTACCACCC
GATCGGCTGG TAGCACTATT AACGCTTTGT GTCGCCGAGG ATGTTGAGGC AACGCCGATT
GGCCGCTTTA CCAACGATGG CCGCCTCCGC GTCTATTACC ATGATCTCGC CGTGGTTGAT
CTTGAGATGG CGTTTCTCCA CGAGGGCCGA CCCCAACGAA TGCTCGAAGC GCGTTGGGAG
CCATCACCGG CATTGACCCG TTCGCCACAG CTCGATCACG TCCAGCCGGA AGCTGCGTTG
CTGGCGTTGC TGGCGCACCC ATCGATTGCC TCGAAAGAGC GTATTATTCG CACCTACGAT
CACGAAGTCG GTGGTGGCAC GGTGATCAAG CCGCTGGTGG GTGCGGCATT AGCCGGCCCC
TCTGATGGAG CCGTCCTCCA GCCATTACCC GACAATCCGG CCGGCCTGGC ATTAGGGTTC
GGTATCTGTC CGCACTACGG TCAGCACGAC CCATACTGGA TGGCACTAGC CGCTATCGAT
GAGGCGCTAC GGAATGTCGT GGCAGTCGGT GGTGATCCCG ACCAAACGGC GATTCTTGAC
AATTTCTGCT GGGGTGATCC TAAACAGCCG GATCGCATGG CCGGGTTAGT ACGGGCAGCG
GCGGCATGTT ACGATGGCGC AGTCGCGTTT GGTACACCCT TTATTAGTGG GAAGGACTCG
CTCAATAACG AGTATCGCGA TGCCGATGGC CGCCGGATAG CGATCCCGCC CACGTTGCTG
ATTTCGGCAA TGGCTTACGT CCATGATGTC AGGCAATGTG TGACGATGGA TCTTAAACAG
GCCGGCGATG TCATCTACTT GCTGGGGGCG ACGCGGATCG AATTTGCCGG TAGTCATCTG
GCCGCAGTAG GGATGATCGA AGATGGTGGT GCGCTGCCAC AGGTCGATCT GGCAACCGCA
CGTGCAACGT TTCGTGCCTT ACACCGCGCT ATCCGTGCCG GCTTGGTGCG CGCCTGTCAT
GATTTGAGCG AGGGTGGGCT GGCGGTAGCG GCGGCTGAGA TGGCGATTGC CGGTGAGCTT
GGGTTGCAGC TCAACCTCGA TACAATCGAT CTCGATCCTA TCGCGGCCTT GTTCAGTGAG
TCACCAAGCC GGTTCTTGCT AGAAGTCGAT CCTGCACAAA CGGCAGCACT GGAAGCAGTG
TTAGATGGTT TGCCACTGGT TCGCTTGGGG GTAGTCACGA CAACGCCGGT TGTACAGATC
ACGCAGCACA ATCGGCAGTT GATTGAGCTG CCGGTGACCA GGTTGCGCGA GGTATGGCAA
CATGGCCTCG ATGCAATTAG TGTAAAGGAC GAAGAACTAT GA
 
Protein sequence
MPLYLVTVAT RTAVSQTLYL LDGPELTPVV VSQLVEQLLH DPVVQQAHWQ MLTDDQPPTD 
PFLPTTDALV VEVAYRPGVT DSEGESVIEG ARRLGVRGLT HARALRRYVL PPETDPVQAA
AELAIEVVHT TIAYRTGQAT TARAAFYAML VAEPAPTRPV VANIPLCTAD DDELLNISQQ
GVLALDLAEM RAIQRYFAGQ GRDPTDGELE TIAQTWSEHC SHKTFKARVR YEQPPETLPV
NADLHPMLAR LASGQMEIDS LIRTFLMRAT EQVLAKRHDE WVLSAFVDNA GIVAFGPNYE
VSYKVETHNH PSALEPFGGA NTGVGGVIRD VLGVSARPIA NIDVLCFGMP DAPPPPPGVL
HPRRVASGVV AGVRDYGNKL GIPTVGGAVL FDPGYTANPL VYCGTVGIAP RGLHPRNVRP
GDIIVVMGGR TGRDGIHGAT FSSIELTHTT AVEVGSAVQI GDPITEKKML DVLLQARDAR
LYSALTDCGA GGLSSAIGEM GAELGAEVHL ERVPCKYAGL QPWEIWLSEA QERMVLAVPP
DRLVALLTLC VAEDVEATPI GRFTNDGRLR VYYHDLAVVD LEMAFLHEGR PQRMLEARWE
PSPALTRSPQ LDHVQPEAAL LALLAHPSIA SKERIIRTYD HEVGGGTVIK PLVGAALAGP
SDGAVLQPLP DNPAGLALGF GICPHYGQHD PYWMALAAID EALRNVVAVG GDPDQTAILD
NFCWGDPKQP DRMAGLVRAA AACYDGAVAF GTPFISGKDS LNNEYRDADG RRIAIPPTLL
ISAMAYVHDV RQCVTMDLKQ AGDVIYLLGA TRIEFAGSHL AAVGMIEDGG ALPQVDLATA
RATFRALHRA IRAGLVRACH DLSEGGLAVA AAEMAIAGEL GLQLNLDTID LDPIAALFSE
SPSRFLLEVD PAQTAALEAV LDGLPLVRLG VVTTTPVVQI TQHNRQLIEL PVTRLREVWQ
HGLDAISVKD EEL