Gene Cag_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2024 
Symbol 
ID3747997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2563625 
End bp2564620 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID637774561 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_380315 
Protein GI78189977 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATGG ATTATAAAAA AGCGGGCGTT GATATTAGCG CAGGTGAAGA GTTTGTGCGC 
ATGATAAAAC CTCAAGTTCG CCAAACCTTT ACCCCAAACG TTATTACCGA TATTGGAGCG
TTTGGCGGCT TTTTTATGCC CGATTTTTCT CGTTACCGTA AGCCTGTGTT GGTAAGCAGC
ATTGATGGTG TTGGCACAAA GCTCAAAATT GCCATTGAGC TTGACCGTTA CAACACCGTT
GGTTCCTGCC TTGTTAACCA TTGCGTAAAC GATATTTTAG TGTGTGGCGC ACGTCCACTT
TTCTTTCTTG ACTATTACGC TTGCGGTAAA CTAACGCCTG CGATTGCCGC TTCGGTGGTA
ACAGGTATGG TTGCCGCTTG TCGTGAAAAT GGCTGTGCGT TAATTGGTGG CGAAACGGCT
GAAATGCCCG GCATGTACAA TGCTGAAGAT TTTGATCTTG CTGGCTCTAT TGTAGGAATG
GTTGACCATG AGCGCATTAT TAACGGCTCA AAAATGCAAG CGGGCGACAT CATGCTTGGC
TTAGCCTCAA ATGGGCTGCA CACCAACGGC TACTCGCTTG CTCGCAAAGT GCTTGCAGGG
CGAATGCACG AAACCATTTC GGAAGCGAAC GAAACCATTG GCGAAGCTCT TTTAAAGGTA
CATCGCACCT ATTTACCTAT TATTGAACCA TTACTTGAAT CCCCCGATAT TCATGGGTTA
TCGCACATTA CGGGTGGCGG CTTAATGGGC AACACTATGC GCATTGTGCC TGAGGGCTTA
AAGCTTGAGG TTGATTGGCA AAGTTGGCAG GAACCTCTTA TTTTTGATAT TATTCGCCGA
GAGGGCAACG TGCCCGAAGA GGATATGCGC CGCACCTTTA ACCTTGGAAT TGGTTTAGTG
ATGATTGTTG CAGCAGAAAG CGTTGAGCGC ATTCTTGCCA ACTTGCAATC ACGCGGCGAA
AATGGCTACA TTATTGGGCA GGTAGCCAAA AGCTAA
 
Protein sequence
MQMDYKKAGV DISAGEEFVR MIKPQVRQTF TPNVITDIGA FGGFFMPDFS RYRKPVLVSS 
IDGVGTKLKI AIELDRYNTV GSCLVNHCVN DILVCGARPL FFLDYYACGK LTPAIAASVV
TGMVAACREN GCALIGGETA EMPGMYNAED FDLAGSIVGM VDHERIINGS KMQAGDIMLG
LASNGLHTNG YSLARKVLAG RMHETISEAN ETIGEALLKV HRTYLPIIEP LLESPDIHGL
SHITGGGLMG NTMRIVPEGL KLEVDWQSWQ EPLIFDIIRR EGNVPEEDMR RTFNLGIGLV
MIVAAESVER ILANLQSRGE NGYIIGQVAK S