Gene Cagg_1254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1254 
Symbol 
ID7266240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1539704 
End bp1540672 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content56% 
IMG OID643566096 
Productfolate-binding protein YgfZ 
Protein accessionYP_002462598 
Protein GI219848165 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR03317] folate-binding protein YgfZ 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGTG TTGTCGCGCA TGACTATGAA GCGGTGTATA CCACCGCTGC GGTTATTGAT 
GAGTCTGACC GAGGGCGACT CTGGATGCGA GGTCGTGATC GGGCGTCGCT GTTGCATCGC
CTCTCGACGA ACCATATCGC GCGACTTCAA CCCGGTCAGG GGACATTGAC GGTCCTCACG
ACGCCAATCG GTCGCATGAT CGACCTGCTA CGGGTGTATG CCCTTCCCGA TGCACTCTTG
CTGGAAACGG GACCGCGTCA TGGCGGGCCA ATCTTGCGTC ATTTGCGTAA AAATATCTTT
TTTAACGACC AGGTCACCGT TGCAGATGCC GGTAGTGAAT TGGGTCAGAT CGGTATCTAT
GGGCCGCAGG CGGGTGAGAT TGTGCAAGCT CTTGGTTTAC CGATGGTCGC GGAACGCTAT
GGGATCGTTG CTGCGCAGTG GGGTGAGACA CCGGTATTGA TCGCCCGTTG TGAGCCGCTC
GGTGGTGATG GCTATACCCT TTATCCGCCG GTAGCCCAAA CCGAGGCGTT GCTGGCTGCG
CTGGTTGCTG CCGGTGCTGC GCCACTTAAT GCTGAAACCG CTGAGGTAGT GCGTATCGAA
CATGGGTATC CACGCTTTGG GCATGAAATT ACCCTCGACT ACATTCCGCT TGAGGCCGAT
CTGTGGCGTG CGGTGAGTTT TCAGAAGGGT TGCTACGTCG GCCAAGAGAT TATTGCACGG
ATGGAGAGCC GGGGTCGGAT TGCTAAGCAG TTGCGCGGGT TGCGATTGAC GGCACTGCCG
ACAATCGTAC CGACTCCACT CACAGTTGAT GGTAAAGAAG TTGGTGTTCT CACCAGTGCT
GCCCACTCAC CACGATATGG TCTGATCGGG TTGGCGTATG TGCGGAGTAG TTACGCCGAT
GACGGTACAA CGGTGTTGGT TGCCGATCAA GTGGCAAACG TGTGCCGGTT GCCCTTTACC
GCTGAGTAG
 
Protein sequence
MNSVVAHDYE AVYTTAAVID ESDRGRLWMR GRDRASLLHR LSTNHIARLQ PGQGTLTVLT 
TPIGRMIDLL RVYALPDALL LETGPRHGGP ILRHLRKNIF FNDQVTVADA GSELGQIGIY
GPQAGEIVQA LGLPMVAERY GIVAAQWGET PVLIARCEPL GGDGYTLYPP VAQTEALLAA
LVAAGAAPLN AETAEVVRIE HGYPRFGHEI TLDYIPLEAD LWRAVSFQKG CYVGQEIIAR
MESRGRIAKQ LRGLRLTALP TIVPTPLTVD GKEVGVLTSA AHSPRYGLIG LAYVRSSYAD
DGTTVLVADQ VANVCRLPFT AE