Gene Cagg_3077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3077 
Symbol 
ID7269494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3739275 
End bp3740408 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content60% 
IMG OID643567897 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_002464371 
Protein GI219849938 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.276953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000285564 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCGGGTAG GAATCTTTGG CGGTGGACAA TTAGCCCAGA TGCTTGTGCA AGCAGCGATA 
AGTCTGGGGG TTGAGATTGT TATCTGTGAG CGGGTGCCGG ACAGCCCGGC GGCACGGTAT
ACGCAGCATA CGGTGGTCGG GCCCTGGGAA GACGAAGCGG TGCTGGCTGC CTTTGCTCGT
CGGTGTGATG TCGTCACGCT CGAGAATGAG TTTATCGATG CCCACTTGCT CAGGCGGGTT
GAGGAGTTGG GCACACCGGT GTGGCCGTCG CCGGCAACCG TCGCTGTAGT CCAGGATAAG
TTGTGGCAGA AGGAGCGCCT GGTAGTGGCA GGGCTTGCGG TGCCCCCGTT TCGGGCAGTA
GACGGACCTG ATGATGTTCT TGCGGCTGCG CAGGCGTTTG GCTGGCCGTT GGTCCTCAAG
ACGCGCCGCA ACGGTTACGA TGGCTACGGC AATGCCACGT TGCGCGGTCC GGCCGATGTC
GTTCCGGCGT GGGAACGGTT GACTCGCGGC GGTAGTCCAT TGCTGGTTGA GGCGTGGGTG
CCGTTTACCC GTGAATTGGC GGTTATGGTG GCCCGTCGCC GTGATGGTAC AACGGCCGTC
TACCCGGTAG TTGAGACGGT GCAGCAAAAC CATATTTGCC ACGTTGTCTA TGCCCCGGCA
GCTATCGCGC CGGATACCGC AATTACTGCG ACTGAGTTGG CAGTCGCTGC CATACAGGCA
GTTGACGGTG TGGGCATTTT TGGGGTCGAA TTATTTGCCC TCGCCGACGG CCATGTGCTG
ATTAATGAGT TGGCCCCGCG GCCTCACAAT TCCGGCCACT ATACGATTGA AGCCTGCGCC
ACCTCGCAAT TTGAACAGCA TTTGCGCGCT GTGCTTGGCT GGCCGCTGGG TGCGACAACC
ATGCGCGCGC CGGCAGCAGT GATGGTTAAC CTACTTGGGC AGCGGAATGC TCCGATCAAT
ACCGATGCTA TCGTTGCAGC ACTGGATGTA CCCGGTGCGC ATATGCACTT TTACGGCAAG
CGCGACGAAC GAGTTGGTCG CAAAATGGGC CATGTTACCG CCCTAGGCGC GACATTACGT
GACGCAGAGG CGATTGCCCG CCGTGCTGCC GGATTGGTGG CAATGCCGGT GTGA
 
Protein sequence
MRVGIFGGGQ LAQMLVQAAI SLGVEIVICE RVPDSPAARY TQHTVVGPWE DEAVLAAFAR 
RCDVVTLENE FIDAHLLRRV EELGTPVWPS PATVAVVQDK LWQKERLVVA GLAVPPFRAV
DGPDDVLAAA QAFGWPLVLK TRRNGYDGYG NATLRGPADV VPAWERLTRG GSPLLVEAWV
PFTRELAVMV ARRRDGTTAV YPVVETVQQN HICHVVYAPA AIAPDTAITA TELAVAAIQA
VDGVGIFGVE LFALADGHVL INELAPRPHN SGHYTIEACA TSQFEQHLRA VLGWPLGATT
MRAPAAVMVN LLGQRNAPIN TDAIVAALDV PGAHMHFYGK RDERVGRKMG HVTALGATLR
DAEAIARRAA GLVAMPV