Gene Cag_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1229 
Symbol 
ID3748262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1631078 
End bp1632223 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content50% 
IMG OID637773762 
Producthomocitrate synthase 
Protein accessionYP_379533 
Protein GI78189195 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR02660] homocitrate synthase NifV 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAATT CTGCAATGGA GCTGCGCAAA TCGTGGATTA TTGATACCAC CTTGCGCGAT 
GGCGAGCAAG CCCCTGGTGT GGTGTTTAGT GCGGAGGAGA AGCGCGATAT TGCAGCGCAA
CTTGCGGCAG CAGGTGTTAG TGAATTGGAG GTTGGTTACC CCGCCATTAG TGGCGATGAG
TTAGAAACCA TCCGCTCAAT TGTTGCTATG CGTCTGCCTT TGCGTGTAAC AAGCTGGGCG
CGTGCAAAGT GGGATGATAT TGAGGCTGCT CGCCAAAGTG GCACCGAAGC GGTTCATATT
AGTTTTCCTG TATCGGCGCT TTATTTGCAA TTAATGGAGC GCTCTTACGA GTGGGTGCAG
GAGCAGTTAA GCGAATTAAT CGGCAAAGCC AAAGATTATT TCGAGTTTGT GAGTGTTGGG
GCGCAAGATG CCACCCGTGC GGATATTGAG CTGCTTTCGC GCTTTGTTTG TGATGCAAGC
GCGGCAGGCG CTCAGCGCAT TCGCCTTGCC GATACGGTGG GAATTGCCAC GCCAATTTCC
GTGATGCACC TTATTGGGGA ACTGCAACGA GTTACTTCAG TGGATCTTGA ATTTCATGCT
CATAACGATC TTGGTATGGC TACAGCAAAT GCGTTTACAG CCCTTGCTGT TGGTTGCCAA
GCCGTTAGTG TGTCGGTAAC GGGGCTTGGC GAACGGGCGG GCAACGCTGC GCTTGAAGAG
CTTGCAATTG CTTTGAAACT TTCGGGAGAG TTTGAAGCCA CCATAAAAAC TGAAATGTTG
TCGAGCTTAT GCGAAACGGT AAGCAAAGCG GCTGGTAGGG TGATTGATGA GCGCAAAGCC
GTGATTGGCA AAGCTGTTTT TCAGCACGAA TCGGGCATTC ACTGTGCCGC ATTGTTGAAG
CATCCGCTCT CTTATCAGCC ATTTTTACCC GAACAAATTG GCGGTAGAGA GCATGAATTG
GTGATTGGCA AGCATTCGGG AAGTGCGGCT ATTCAGCACT TTTTTGCCGA GCGAGGCATT
CCGCTGAGCC GCAGCGAGGC AACACAGTTG TTAGCAAAGG TTCGCCAAAT GGCGACTGAA
AAAAAAGGAT TGCTTACAGC TAAAGAACTT GAAGAGCTTT ATACAGAACT GTTTAATATT
CATTGA
 
Protein sequence
MPNSAMELRK SWIIDTTLRD GEQAPGVVFS AEEKRDIAAQ LAAAGVSELE VGYPAISGDE 
LETIRSIVAM RLPLRVTSWA RAKWDDIEAA RQSGTEAVHI SFPVSALYLQ LMERSYEWVQ
EQLSELIGKA KDYFEFVSVG AQDATRADIE LLSRFVCDAS AAGAQRIRLA DTVGIATPIS
VMHLIGELQR VTSVDLEFHA HNDLGMATAN AFTALAVGCQ AVSVSVTGLG ERAGNAALEE
LAIALKLSGE FEATIKTEML SSLCETVSKA AGRVIDERKA VIGKAVFQHE SGIHCAALLK
HPLSYQPFLP EQIGGREHEL VIGKHSGSAA IQHFFAERGI PLSRSEATQL LAKVRQMATE
KKGLLTAKEL EELYTELFNI H