Gene Cagg_1734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1734 
Symbol 
ID7269440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2119935 
End bp2121140 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content57% 
IMG OID643566576 
Producttype II secretion system protein E 
Protein accessionYP_002463071 
Protein GI219848638 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.494543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCAAACCT TATGGGACAC TAGAGTGCAG TCGGTGCCAT CACCGGCTAC CGGTGCAGCC 
ATCACGGAAG AATTACGGAT GTTGCTGCGC AGCGGTCGTC TGCGCGATTG CTGGCACTTA
CCACCTGATG AGGCGTTTGC TCGGCTCGGA TTCGATGACT CGGTGCCATG GCGCGATGTG
GCATGGTACG GTCCTATCGA GATTTGGCGT GATCCTGAGC ATGCGGTGTC AGACATTCTA
TTTAACGGTC CGTCCGATTC GCCCTTTTTT GTGGTGCAGC GCGGCATGAT GGTCAATACC
GGTGTAATCG TCCACCCGGC CTGGATCGAT TGGACGCAAC GTCAGTTGGT GCTACGTAGT
CACGGTGTGA TCGGCGATGC TCCACTGCCG GCATTCGTCC AAGGCGTTGT TGACAGGTTG
CGCTATGCCA TAACGAACCG ACGCGCTTCC CCATCTGGAC CGAGTCTGGC GATTCGCTTA
CTGCCCGAAC GGTGGGCAAC ACTCGACGAT CTTGTGCAGA GCAACGTCAT TAGTCGGGAA
GCCGGTGAAC TCTTATTGGC GGCTCTTAAC GGTGGTGCAT CAGTGCTGAT TGCCGGTCCG
ACCGGTAGTG GAAAGACAAC ACTAGCCGCC GCGTTGACCC AGGCGATTGG CACACGTATG
CGCTTGGTCG TCATTGAAGA TGGTGGGGAG CTGCCCCATA GCGCCAATAG TTTACATATT
GAAGCACCGG CTGAAACCGG TGGTTTTAGC CGTGCTGTGA CCTTTGCCCT TCGCCAAAAG
CCCAACTACA TCATCGTTGG TGAGGTACGT GGTGGCGAGG CAATGGCGAT GTTACAAGCG
GCCGCAACCG GTCATCCCGG TTTAGGCACC ATTCACGCGG CGACGGTACA AGGAGCGTTA
CGAAACCTTG AGCGGATGGC GCTGATCGGC TTGGCCCATG AGACAACCGG TGCCGGGCAG
GCAGCAGCTC AGATCGTGCG CGGTTTGATC ACCTCTGATG TCGTGAACCT GTTGGTAGTC
CAGATCGGAC GTGCTCCTAA TGGGAAGCGT GGTGTGATGG CCATCGAAGA GGTGTTACCC
CAAGGCTCAC AAGGTCAGAG TGGTGATCCT TTCCCAACAA ACCCACTTTT TCGTTATGAA
CGGACGAGTG AACAGTTGGT ACGGGCCGGC TATGTTAATG CAGGGTGGGG ATTGGGTCGG
ATGTAA
 
Protein sequence
MQTLWDTRVQ SVPSPATGAA ITEELRMLLR SGRLRDCWHL PPDEAFARLG FDDSVPWRDV 
AWYGPIEIWR DPEHAVSDIL FNGPSDSPFF VVQRGMMVNT GVIVHPAWID WTQRQLVLRS
HGVIGDAPLP AFVQGVVDRL RYAITNRRAS PSGPSLAIRL LPERWATLDD LVQSNVISRE
AGELLLAALN GGASVLIAGP TGSGKTTLAA ALTQAIGTRM RLVVIEDGGE LPHSANSLHI
EAPAETGGFS RAVTFALRQK PNYIIVGEVR GGEAMAMLQA AATGHPGLGT IHAATVQGAL
RNLERMALIG LAHETTGAGQ AAAQIVRGLI TSDVVNLLVV QIGRAPNGKR GVMAIEEVLP
QGSQGQSGDP FPTNPLFRYE RTSEQLVRAG YVNAGWGLGR M