Gene Cag_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0171 
Symbol 
ID3747697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp191587 
End bp192873 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637772698 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_378492 
Protein GI78188154 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT TAATTGTAGG AAGTGGTGCG CGTGAACATG CTTTGGCGTG GGCAGTTGCG 
CAAAGCGCCG AAGTAGCGCA GGTGTTTGTA GCACCCGGTA ATGGTGGCAC TGCCCAAATG
GGTGGCAAGG TGTGCAATTG CTCAATAAAA GCAACAGCGA TTAATGAGTT GCTGGCTTTT
GTGCAGCAAG AGTCCATTGG CTTAACGGTG GTTGGTTCCG AGCAACCGTT GGAGCTTGGC
ATTGTTGACC AATTCCGTAG CGCTGGTTTA GCAATTGTTG GACCAACGCA ATATGCCGCT
CAGCTTGAAA GCAGCAAAGT TTTTGCAAAA GCTTTTATGC AGCGCCACGC CATTCCAACG
GCAGGCTATC AACGCTTTTG CGATGTTGCA TCCGCACAAA CCTATCTTCA GCAGCCTCAA
TTACCATTCC CCCAAGTTAT TAAAGCAAGC GGACTTTGCG CCGGTAAAGG GGTGATTGTT
GCTATGAATA AAGCCGAAGC GCTTGCGGCG GTTAGCGATA TGCTGGAGGA TCGCATTTTT
GGCGATGCTG CCAATGAGGT GGTTATTGAA GCTTTTTTGC AGGGCGAAGA GGCAAGTGTG
TTTGCCTTAA CCGATGGCGT TTCCTACAAA CTTTTTTTAC CCGCTCAAGA TCATAAGCGA
GTGGGCAATG GCGATACGGG CAAAAATACG GGTGGCATGG GAGCTTATGC GCCCGCACCA
ATTGTTACGC CTGAGGTGAT GCAGAAAGTT GAAGAGCGCA TTATTCGCCC AACCTTGCAA
GGCATGGCGG CGGAAGGCTC TCCTTACACG GGCTTTTTAT ATGTTGGATT GATGATTGAT
AAGGGTGAAC CTTCGGTGGT GGAGTTTAAT GCGCGCTTGG GCGATCCTGA AACGCAAGTG
GTGCTTCCTC TCTTAAAGAG CGACTTTTTT GCTGCGTTGC GTGCTTCGGT GGATGGAACA
TTGGAATCGG CACCATTTGA AATGTATGCT AAATCAGCCA CAACAGTAGT AGTGGCATCA
CAAGGTTACC CCGATAGCTA TACAACAGGC AAGCCAATTA CCATTGCTCC CGAAGCCGCC
ACAATGGAAG AGGCTATAAT TTTTCATGCA GGCACAGCGC TTCAAGGCAA CTCACTTGTT
ACGGCAGGTG GGCGCGTTTT TTCAGCAACA GCGCTGGGCA ATTCATTACA AGAAAGCATA
ACGCGCTCAT ACGCATTAGT GCAGCACATT ACCTTTGAAG GTGCATTTTA CCGCAACGAT
ATTGGAGTAA AAGGATTGCG AGCATGA
 
Protein sequence
MKVLIVGSGA REHALAWAVA QSAEVAQVFV APGNGGTAQM GGKVCNCSIK ATAINELLAF 
VQQESIGLTV VGSEQPLELG IVDQFRSAGL AIVGPTQYAA QLESSKVFAK AFMQRHAIPT
AGYQRFCDVA SAQTYLQQPQ LPFPQVIKAS GLCAGKGVIV AMNKAEALAA VSDMLEDRIF
GDAANEVVIE AFLQGEEASV FALTDGVSYK LFLPAQDHKR VGNGDTGKNT GGMGAYAPAP
IVTPEVMQKV EERIIRPTLQ GMAAEGSPYT GFLYVGLMID KGEPSVVEFN ARLGDPETQV
VLPLLKSDFF AALRASVDGT LESAPFEMYA KSATTVVVAS QGYPDSYTTG KPITIAPEAA
TMEEAIIFHA GTALQGNSLV TAGGRVFSAT ALGNSLQESI TRSYALVQHI TFEGAFYRND
IGVKGLRA