Gene Cagg_3655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3655 
Symbol 
ID7268190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4445439 
End bp4446683 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content55% 
IMG OID643568461 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002464927 
Protein GI219850494 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00286404 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATG AGATGAAGCG CGTCGCCAGT CGTCGCCATT TACGGTACGG TTTGGCTTTC 
GAGCGCACCC AACGCCCGCT AGCGGCCCTC GAGCAGTTTC GCCGAGCGAT TGCCGCCGAC
CCAACGATGC GTGACGCCCA TAACGCACTG GCCTTCCATT ATCAGCAGCA AGGCTTGTTG
GCAAAAGCGG CCGACACGTT TGCGGCGGTC GCTACGCTTG CCGACGATTA CTTTGCCCAT
TTCAACCTCG GCTTCGTCTT GATCGAACTT GAACGGTACG ACGAAGCGGA ACGAGAGTTC
CAGCGTTGTC TTGAGCTTGA TCCGAATGAT ACGGCTGCAC AACTCGAATT AGCCTACATC
TACGCAGCAC GTGGTGAATA TGCCAAAGCA GTGACGATGC TTGAAAAGCC GCGCCAACGT
TACTACGATG ACTGGGCGGT TTTTCATCTG CTGGGGCGTT GTCTCTTTCA ATTACAACAG
TTTGATGAAG CACAACAAGC CTGGCAACAC GCTTTGGCAC TTGCTCCCCA CGCGGAAGCA
CAATTTGACC TGCTGGCGTG TTTGCAGAGT ATTGAGCGGC GGCGAGAGTT TCACACATGC
ACCGGGCTGA AAGATGACCT CTACGTACAC GAAGGTGTAA TCTGTCTCGG TTCGGCGATG
GATGACGGCA TCACCATCCA TCCACTACAC AATTATCACC TGCGTGAACA CGATATTGTG
CGTACCATCC AACGCTTCGT CGCACTTGCC CGCAGCAGCA AATGGCAGTT GGGAGGGATT
GTTGCCCCTG ATCTGACCAG CAAACCACTC GCCGCTGCAC TGGCTCAGCT TCTCGAGCTA
CCGCTGTTGA ACTTACACCA TCTCCCGAAT ATCACGACGC CGATCTTGCT CGTCATCGCG
GTTGGTCACT CAGCCGACCT CTTCACCATC GCCCGAGAAC GATTACCCAT CCCCAGTCCT
GGCTTCTGCC TCGCCGTCAA TTGGATGAAA CACAGTCCTT TGTGGCCTGA AGTAGTGGGG
GTAGTGGTAG AAGGTGTATG TACCGTACCG TGGGCAGATG AACTACGTGG CTTGTCTTCA
GCCGAACAAA TGACGGTGAT CCAGCGTATC GCCGACCGGC TGGCCGCCCA AGTACAATAC
GTATCCGGTG AAGAGAATCT ACCTCGCCAG ATTCGCTACT ATACCCGCCA CCATCGTCGG
TTGGCGATCA ACAGTCTACT ACCCACGACC AAACCACGGC TCTGA
 
Protein sequence
MTDEMKRVAS RRHLRYGLAF ERTQRPLAAL EQFRRAIAAD PTMRDAHNAL AFHYQQQGLL 
AKAADTFAAV ATLADDYFAH FNLGFVLIEL ERYDEAEREF QRCLELDPND TAAQLELAYI
YAARGEYAKA VTMLEKPRQR YYDDWAVFHL LGRCLFQLQQ FDEAQQAWQH ALALAPHAEA
QFDLLACLQS IERRREFHTC TGLKDDLYVH EGVICLGSAM DDGITIHPLH NYHLREHDIV
RTIQRFVALA RSSKWQLGGI VAPDLTSKPL AAALAQLLEL PLLNLHHLPN ITTPILLVIA
VGHSADLFTI ARERLPIPSP GFCLAVNWMK HSPLWPEVVG VVVEGVCTVP WADELRGLSS
AEQMTVIQRI ADRLAAQVQY VSGEENLPRQ IRYYTRHHRR LAINSLLPTT KPRL