Gene Cagg_3396 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3396 
Symbol 
ID7267136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4122518 
End bp4125745 
Gene Length3228 bp 
Protein Length1075 aa 
Translation table11 
GC content55% 
IMG OID643568205 
ProductTPR repeat-containing protein 
Protein accessionYP_002464676 
Protein GI219850243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCCC AGAGTACGAT CGGTCGCTGG AGTGTACGGG TGATTGAGGC TAGCTGGTTA 
TTAGCCCTGA CACTCGTGCC TATTTATTTT AACCTCTATT CGGCCCGCCA TTTTGAACCT
GACAAGGCGG CTGTGTTGCG CTCACTTGCG TTGATCGGTT TCGCCGCCGG GTTAATTTGG
TTACTTGATT GGCTTGGTCA GCGCCCTTCC AATACCCGAC CGGCGTCGTG GTGGACAACC
GTGCATGCCT CACCGCTCTT CTGGCCAACT GTGGCCTACA CGGCGGTATT TGCTTTTACA
ACGATAACGT CCATCACGCC CACAGTTAGC CTGTGGGGAT CATACCAACG CATGCAGGGG
TTGTACACCT ACCTTTCCTA CGTGACGTTA GGGGTGTTAG TCGCCGTCGC CTTACGCACA
CCGGCCCAAC GTGAGCGCTT TATCACAATA AGTCTGGCTG CTGCCACCGT CGTATCGGTG
TACGGGATCA TGCAGCATTA TCAGCTTGAT CCGTTGCCGT GGCGCGGTGA TGTGATTGCC
CGCGTCGCCT CAACGCTGGG GAATTCGATT TTCGTAGCAG CCTACTTGAT CATGATCGTG
CCACTGGCAC TCTACCGACT GGGTGCGGCT CTGGCCGCTG CGCAGGTCGC CCCAAGAAGC
GATAGGCCGG CAGCCGAATG GCGCTGGGCG TTGTCACGGG TTTCACTCTG GTTGAGCGGC
ACCCTCCTGA TTATCGCGAT CTTGAAGTTT AGTGTCGCAA TTCGGACAAT TGACTTTCGT
TATTGGTGGC TCTTCCCAGG AGCAATCATC TGTGCGACTG CCCTCTGGTG GCTACTAACA
GCTCGTCGTA GCCAAGCACT CCCACGCTGG CCACTGGTTC TTACCCTCCT GTATCTGCTA
GGCTTTGCAT TAGCGTTCGT TTTGAACGCA TCTGCCGGCG TACAGCAATT TGCATTACCC
GACATCGCAG CAAATGCGCT TGATTGGAGC GTATGGCTCG GATTAGCGAT AGCTGCCTTA
GTCATCGGCT ATGGGCTTTC CCTTATCGGT AATCCACCAC CCGAACCGTC ACGTTTGACG
TGGCAGCTTA CGGCTGTTGC CAGCGCGGGT GTGTTGATTC TTGCCTTGCT GACCATCTTT
TTCAGCCAGA GTCGCGGTCC GTGGATCGGA CTCGGCGCCG GACTATTTGT GTTCATCTCG
TTGATGCTGT GGTACGGGCG ACAACGGTTA CGGGCAAATG GCAATCTTAG CGGTAGTCGT
CTGCTGACTC AGGCATTGGC TGGATGGGTA GCGTTGATCG TCCTGATCGG TGGGTTTCTG
ATCGTGTTTA ATCTGTCTGA TGCTCCCATC TTTACCCAAA TGCGCGAAAT TCCGTACCTC
GGTCGTATGG GTCGACTGCT GGAAGTGGAT AGCGGTACCG GCTTAGTACG GCGGTTGATT
TGGGTGGGAG ATGAGCACGG GCGTGGAACT ATCGGTCTGA TCACGAGCGA TCCACTGCGC
CTCCTCATTG GGTGGGGGCC GGAGAGTATG TTCGTCGCAT TCAATCGTTT CTATCCACCG
TCGCTGGCAC ATGTTGAAGC GCGTGGCGCT TCCCCCGATC GTTCACATCA GGCATTGCTC
GATGAGCTTG TGACAAAAGG CTTGCTTGGC CTCGTTACCT ACCTCTGGTT GATCGGTAGC
ATAGTCTGGT TTTGTCTTCT TCAATTGCGA CGACCGACGA GCTGGCAGTA CCAACTGTGC
ATCATTGGTA TTTTGAGCGC AATCACCGCC CATGTGATCG AAGGGCTAAC CGGTATTCCC
ATTGTGGCAA CCCTGATGAT GTTTTGGTTG TTGATAGGGT TGGCTATCGC AGCAGAGCGC
ATCGAGCACG GGCATGCCCA ACGACCGGCC ACAATCCCTG AACCGGTCGT GGCGCAGCGG
GCCGAACGCC CGGCTACGCG CCGAAATCAA CCGCGCCCGC CAGTACGTCG TCCGTCCAGT
CGCCGTCCGG TTACCGGTAC GATAGGCACC GCCGCTCTGA TCGGTCTGGT TACTGCCGGC
CTGATCTGGT GGCTGAACAT ACAGCCCATT TACGCCGACA TGCGCTTTCA ACAAGGTCAA
AGCTATAGCG ACCAGGGTCA AGTTTCGATC AGCACCCAGA TCGCCACCCT CAACGAATAT
ATCGCCACCA TCCGGGCTAA TCCGGGTGAA GACTTTTACT ACTTACACCT TGCCCGTAGC
TTAATGTCAC TTGCCGATAC GCTTCGTCGC CAGGGTGTCA ATCTTGGTGA GGCCGGTCAA
CCTCGTCTTG ACGCACTGTT ACGCCTCGAT GGGATAGAGG CGGCGACCGG ATTCGTCCAG
CGTTCTTCGC CGTTAAGTTT ACTCGCCTAC GCAGAGGCAA CATTACAGCA CGCTCATCAG
CTTAGTCCAC TCAATAAAGA CCATTACGCA AATTTAGGCA GAATCAATCT ATTCTGGTAC
AGTTGGACAA ACGACGTGCA ACGATTATAT ACAGCGCTCA AGTGGTACGA GCGCGTCGCC
GAGATTGCCC CTCAAGATGT TGCACTGATG AACGAGCGGG CCGGTGTTCT GATACAACTG
GCCGAATATG CTACTATCAG CGGTGATACT GCACAAGCCG GTGCATTTTT TCAACAGGCC
GATGAGTTGT TGCAGACTTC GGCTCAACTC GATCCACGCT TCGGTGACAC GTCTCTCCGA
CGCGGTGATC TGATCCGTCT TCGCAGCGGT GATCTCGATA CGGCCACGAC ATTCTACCTC
CGTGCTATCG AGCAAGCACC TCAGCAGATG GTTGACAACC TTGATCGGAT TACTCGCGCG
TTGAACAGTC GCCCCGATCT GCTTAACCAA CTACGTAACG CTTTCACCGT ACAAGCGACG
CGAGCAGAGC AGGTGTTAGC CGAAGCGCGG GAAAAACCGG AACGGGCATT CGAGCTACCC
ACCCTTGAAA CACAAGCTGC AAACCTGTAC GCAGCCGTCG CCCGCCTTGC TGTGCAGACA
AACGATCTTG CCGGCGCCAT TGAGCCGTAT GCACGGGCAG TGTCGATCCA ACCGGCAAAT
GCTGCGTTGA GCGAGCAGTA TACGTTAGTT TTGAGTGAAA CACTGCAATA CGACGCCGCC
CTCACCGAAG CGCGTCGCTT GCTGGCGGTT TTGCAAAGCA ATGGGCGTAC CGGCGAGATG
GCGCGGATCG AGCAGTTGAT TGCGCTGATC GAACAGGTAC GAAAATAG
 
Protein sequence
MSSQSTIGRW SVRVIEASWL LALTLVPIYF NLYSARHFEP DKAAVLRSLA LIGFAAGLIW 
LLDWLGQRPS NTRPASWWTT VHASPLFWPT VAYTAVFAFT TITSITPTVS LWGSYQRMQG
LYTYLSYVTL GVLVAVALRT PAQRERFITI SLAAATVVSV YGIMQHYQLD PLPWRGDVIA
RVASTLGNSI FVAAYLIMIV PLALYRLGAA LAAAQVAPRS DRPAAEWRWA LSRVSLWLSG
TLLIIAILKF SVAIRTIDFR YWWLFPGAII CATALWWLLT ARRSQALPRW PLVLTLLYLL
GFALAFVLNA SAGVQQFALP DIAANALDWS VWLGLAIAAL VIGYGLSLIG NPPPEPSRLT
WQLTAVASAG VLILALLTIF FSQSRGPWIG LGAGLFVFIS LMLWYGRQRL RANGNLSGSR
LLTQALAGWV ALIVLIGGFL IVFNLSDAPI FTQMREIPYL GRMGRLLEVD SGTGLVRRLI
WVGDEHGRGT IGLITSDPLR LLIGWGPESM FVAFNRFYPP SLAHVEARGA SPDRSHQALL
DELVTKGLLG LVTYLWLIGS IVWFCLLQLR RPTSWQYQLC IIGILSAITA HVIEGLTGIP
IVATLMMFWL LIGLAIAAER IEHGHAQRPA TIPEPVVAQR AERPATRRNQ PRPPVRRPSS
RRPVTGTIGT AALIGLVTAG LIWWLNIQPI YADMRFQQGQ SYSDQGQVSI STQIATLNEY
IATIRANPGE DFYYLHLARS LMSLADTLRR QGVNLGEAGQ PRLDALLRLD GIEAATGFVQ
RSSPLSLLAY AEATLQHAHQ LSPLNKDHYA NLGRINLFWY SWTNDVQRLY TALKWYERVA
EIAPQDVALM NERAGVLIQL AEYATISGDT AQAGAFFQQA DELLQTSAQL DPRFGDTSLR
RGDLIRLRSG DLDTATTFYL RAIEQAPQQM VDNLDRITRA LNSRPDLLNQ LRNAFTVQAT
RAEQVLAEAR EKPERAFELP TLETQAANLY AAVARLAVQT NDLAGAIEPY ARAVSIQPAN
AALSEQYTLV LSETLQYDAA LTEARRLLAV LQSNGRTGEM ARIEQLIALI EQVRK