Gene Cagg_2184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2184 
Symbol 
ID7266757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2677637 
End bp2678662 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content60% 
IMG OID643567015 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_002463503 
Protein GI219849070 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.711574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATATA GTGCCGCCGG AGTTGATATT GCTGCCGCCA CCCGGGCCAA AGAACTGATG 
ACCACCGCCG TGCGCAGCAC CCATGGGCCG GCCGTCTTGG CCGGAATGGG AGCGTTCGGT
GGTTGTTTTG ATGCCGCACT CGCTCTAGCC GGTATGCAGG CCCCGGTGCT CGTGAGCAGC
ACCGATGGTG TCGGGACCAA AACTTTAGTC GCCGCTGCTT TAGAACGCTA CGATACGGTT
GGGCAGGATT TAGTCAACCA TGCCGTAAAC GATATTTTGG TGCAGGGTGC GCGACCGCTC
TTCTTTCTTG ATTACATTGC CGTCGCCAAA CTCGATCCCA TCCAGATTGC CGCTATCGTC
AGCGGTGTGG CAGCAGGATG TCGCGCCGTC AGTTGTGCGC TGATCGGGGG CGAAACGGCT
GAAATGCCCG ATATCTACGC CCCCGGTGCC TTCGATCTGG CCGGCACAAT CGTCGGTGTG
GTCGAACGGG CCGATCTCTT GCCGCGTCCT GATGTGACCG CCGGCGATGC GATCTTGGCC
CTCCCTAGCA CCGGTCTACA TACCAATGGC TACTCGCTGG CCCGTCGGAT CGTCGCTCAA
CACTTCGCCA CCGAAGGCTA CCACGCGCGT CCGTCATTGC TCGGCGGACA AACCATCGGC
GAGGCGCTAC TGGCTATTCA CCGTTGTTAT CTCGCTGAAG TGAACGCACT GCGCGCAGTT
GTCCCGGTGA AAGCCCTCTG CCACATCACC GGTGGCGGGA TTTATGACAA TCTCCCCCGC
GTGCTTCCCA AGGGGATGGG CGCAGAACTC GTGCGCGGCA GTTGGACCAT TCCCCCGATT
TGTCAGTTGC TGGTAGAAGT CGGTGGCCTC GCCGAGAGTG AGGCTTATCA CACGCTCAAT
ATGGGGCTGG GCATGCTGGT GATCGTCCCC ACCGAGTCGG TTGCCACCGC CCAAAAGGCC
GTTGCCGAAG CACAACTGGT CGGTAGAGTA ACGGCCACAC CGACAGTGCG CTTGGTCGAC
GGATAA
 
Protein sequence
MKYSAAGVDI AAATRAKELM TTAVRSTHGP AVLAGMGAFG GCFDAALALA GMQAPVLVSS 
TDGVGTKTLV AAALERYDTV GQDLVNHAVN DILVQGARPL FFLDYIAVAK LDPIQIAAIV
SGVAAGCRAV SCALIGGETA EMPDIYAPGA FDLAGTIVGV VERADLLPRP DVTAGDAILA
LPSTGLHTNG YSLARRIVAQ HFATEGYHAR PSLLGGQTIG EALLAIHRCY LAEVNALRAV
VPVKALCHIT GGGIYDNLPR VLPKGMGAEL VRGSWTIPPI CQLLVEVGGL AESEAYHTLN
MGLGMLVIVP TESVATAQKA VAEAQLVGRV TATPTVRLVD G