Gene Cag_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0458 
Symbol 
ID3747383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp536332 
End bp537618 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637772991 
Productfolylpolyglutamate synthetase 
Protein accessionYP_378774 
Protein GI78188436 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCTATC AAGAAGCTCT CAATTTTCTT TATCCTCTCC ATCGTTTTGG GATTAAGCCC 
GGGCTTGAGC GTGTGCAGGC GCTTTTGCAA ACACACGGTA ATCCCCATAA GCGGCTTGGT
AGGGTAGTGC ATCTTGCAGG CACAAATGGT AAGGGTACCA CGGCGGCAGC GTTGGCGGCA
ATGTTTCAAG CAAGTGGTTA TAAAACGGCG CTTTACACCT CACCGCATCT TGTAGATTTT
ACCGAGCGCA TTCGCATTAA TGGGCAGCCG ATATCGCAAG AGTTTGTTGC GCACTATTGT
GCCATCATGC AGTCAACCAT TCAAGCCACA AACGCCACCT TTTTTGAAGC CACGACTGCT
TTGGCATTTT CATGGTTTGC CGATGAAGCG GTTGAGGTTG CGGTGATTGA AACGGGACTT
GGTGGTCGCT TGGATGCTAC CAATGTGGTA GAGCCTGAAT TTGTTGTTAT TCCAACCATT
GGGCGCGATC ACGTTGAATG GCTTGGTGTA ACCTTACCAG CCATTGCGGC AGAAAAAGCG
GCTATTATTA AGCAAGGGTG CTCTGTTTTT ACAGCGGCTA CTCAGCCTGA AGTGCTTGCG
GTGATTGAGC AACAAGCGCA GGCGTGCAAT GCAGCGTTGT TTCTTGCTGG GCGAGATGTG
CATTATGAGG TTGTTGCTTC TGAACCCGGT TTGCTCGGCT TGCATGTTCA AACAGCAACG
CAGCGTTATG CAGAGCTTTA TCTCCCTCTT ACGGGTACCT TTCATGCGGC AACTATTGCG
CTTTCTGTGC AAGTTGCTGA ATGTGCGGGA TTGTCGGCTC ATATTATAAA GCATGGTCTT
CAGCAGCTTT TGCAAACGGG TTATCGTGCT CGGCTTGAAT TTGTTAATAA CGCACCAGCA
ATTTTTCTTG ATGTGTCGCA CAACCCTGAT GGTATGAAAG CAACCGTTGA TGCGCTGCTT
ACCTATCGTG AGCGTTACAA GCGCACCTTT GTGTTGCTGG GGCTTGCTTC CGATAAGGAT
GCGCTTGCCG TTATTCGTGA ACTGCAACGT CTTAATCCGT TGCTTGTTGC GGTGAACATT
CCCTCAGAGC GAAGTGTGGC TGCCGAACAG CTTGGCGCCT TATGCCAGCA AGAGGGTATT
GAGTTCATCA TACAAGGTGA TAGCGTGGCA GGATTGCGAT TTATTGAGCA GCAAGCAGGG
GAGCGTGATA TGGTGCTTAT TACAGGTTCA TTTTTTTTAG CAGGAGAACT GCTTGCTCAT
GGATTTTTTA AGGCAGTAAT GCAATAG
 
Protein sequence
MTYQEALNFL YPLHRFGIKP GLERVQALLQ THGNPHKRLG RVVHLAGTNG KGTTAAALAA 
MFQASGYKTA LYTSPHLVDF TERIRINGQP ISQEFVAHYC AIMQSTIQAT NATFFEATTA
LAFSWFADEA VEVAVIETGL GGRLDATNVV EPEFVVIPTI GRDHVEWLGV TLPAIAAEKA
AIIKQGCSVF TAATQPEVLA VIEQQAQACN AALFLAGRDV HYEVVASEPG LLGLHVQTAT
QRYAELYLPL TGTFHAATIA LSVQVAECAG LSAHIIKHGL QQLLQTGYRA RLEFVNNAPA
IFLDVSHNPD GMKATVDALL TYRERYKRTF VLLGLASDKD ALAVIRELQR LNPLLVAVNI
PSERSVAAEQ LGALCQQEGI EFIIQGDSVA GLRFIEQQAG ERDMVLITGS FFLAGELLAH
GFFKAVMQ