Gene Cagg_3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3099 
Symbol 
ID7269516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3760420 
End bp3761736 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content59% 
IMG OID643567919 
ProductFolC bifunctional protein 
Protein accessionYP_002464393 
Protein GI219849960 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCA CGACCTACCA GGCGGCACTA GACTACATCT ACAGTTTCAT CGATCCCACC 
CGCCAAGGAT CGCCCGATCC CGCGATTGCC CAACGCGGAC TAGATCGAAT TACCGCATTG
TTACGCGATG CCGGCAATCC ACACCAACAA TTACGTGCCG TTGTGGTAGC CGGTACCAAA
GGCAAAGGCA GCACCTGCGC GATGATCGAA GCAATGGCCC GCGCCGCCGG GCTGAAGGTG
GGGTTGTGGA CATCACCACA CCTTAGCTCG TACCGCGAAC GGATCCAGAT TGACCGTGAG
CCGATTAGTC AGCAGACGCT GGTCGAGTTA GTCAATGCAG TACTGCCGGT GGTAGAGGGG
TTTGACGGAG CGACCTATGG CCGGCCTAGC ACGTTCGATA TTGGTTTTGT GATGGCGATG
CGCCATTTTG TCGCCGAACA GGTCGATTTG GCCGTCGTCG AGGTTGGTTT GGGGGGAAGG
TATGACGCGG CGGCGACGAT CACACCGCTG GTCGCAGTGA TTTCCTCGAT CAGCTATGAC
CACATGGCAA TACTAGGACC AACGTTAGAC AAGATAGCTT TTAACAAAGC CGGAATTATC
CGTTCTGGGC AACCGGCGAT TAGTGTCCCC CAACAGGCCG ATGCCGCCGA AGTGATTGCG
GCTGAAGCGC AGATGGTTGG CGCACCACTC TGGCTTGCGG CCGAACCAGC AGTTGAGCCA
TGGGTTGGGA CGACAACACG ACTGGCTTAT CCCGCCCCGC CGCAGCCGGG CAAGTTGCAC
GGCACGTTCC AACGAGAAAA TGCGCGGTTG GCGATGGGAG CAGCGTTGCT GTTGCGCGGG
CAGGGGATTG CGATTGACGA TGCCGCGATC CGGCGCGGGT TAGCCGAAGC GTGGTGGCCG
GGCCGGTTTG AGGTGATCGA TGGTCGACCG CGTATCCTCA TCGATGGTGC CCACAATGGT
GATTCGGCAG TAAAGTTGTG GCAGGCTATC GAGCAAGAAT TACCACATCG CCGGTTTATT
CTCGTGCTCG GTACATCCCG CGATAAAGAT ATTGCTGCCA TCGCTGCCGC ACTCGCTCCA
CACGCCGATC ACATCATCAT TACCCGTTCG AGCCATCCCA AAGCGATGGA CCTCGACCGC
ATCGCGGCTG AGGTTGAACC GTTCGCGAGT GCGCCCATGA CCATCGTGCC TGTCGTGGCC
GAAGCGATTG CGACAGCACG CACCTTAGCC GGACCCGCTG ACCTGATCTG CGTTACCGGT
TCGCTGTTTG TAGCCGGTGC AGCACGCGAG GCGTTGGGGT TGGCGGTGGC AGATTAG
 
Protein sequence
MSITTYQAAL DYIYSFIDPT RQGSPDPAIA QRGLDRITAL LRDAGNPHQQ LRAVVVAGTK 
GKGSTCAMIE AMARAAGLKV GLWTSPHLSS YRERIQIDRE PISQQTLVEL VNAVLPVVEG
FDGATYGRPS TFDIGFVMAM RHFVAEQVDL AVVEVGLGGR YDAAATITPL VAVISSISYD
HMAILGPTLD KIAFNKAGII RSGQPAISVP QQADAAEVIA AEAQMVGAPL WLAAEPAVEP
WVGTTTRLAY PAPPQPGKLH GTFQRENARL AMGAALLLRG QGIAIDDAAI RRGLAEAWWP
GRFEVIDGRP RILIDGAHNG DSAVKLWQAI EQELPHRRFI LVLGTSRDKD IAAIAAALAP
HADHIIITRS SHPKAMDLDR IAAEVEPFAS APMTIVPVVA EAIATARTLA GPADLICVTG
SLFVAGAARE ALGLAVAD