Gene Cagg_2566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2566 
Symbol 
ID7267155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3125357 
End bp3127417 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content59% 
IMG OID643567390 
Producthypothetical protein 
Protein accessionYP_002463871 
Protein GI219849438 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.359995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGTC GCGAATTTCA ACCGGTCTCG ACGACGGTAG GCGGTGAGCA ACTGACTACC 
AGCCGACTGA GTGCCGGTCG GCGTTGGTTG CTCAGTGGTT TGGTCATCGG CTTGCTCGTT
GCTATGACGA CAACCGGGCG GGCTGTGTTA GCGACGGCCT TCTGGCTCAT TGCGCCCGGC
TACCTGATCG AACGCTATCT TCCCGGTGCG CGTCCGCATT GGTTGCTACG GATTGCGCTC
TGGGTCGGGA TTGGTTTGAG CATCCTACCG CTAGTTTACC TATGGACAAC CGCCATCGGT
GGAGCGGTAA CGCCCTCTCT GCTAATCGGC GGTGCATTCC TGTTGGCGAT AAGCACCGGC
GTGGCCGCCT GGCATGATCT CGCAGCCGTA CACGATCGAC CATCGACCGC GACGTGGGTG
TTGCTTACGG TACTCGTGAT CACCGCCGGG TTGCGTGCTG TGCAGATCGC CGATCTCGCT
TTTCCACCGT GGGTCGACTC GGTGCATCAC GCCTTGATGA TCCGGGTTGC AGCCGAGACA
GGGCGAGTTC CGTGGACGCT GACCCCCTAC TTGCCGGTAG TTGACATGCC GTATCACTGG
GGGTATCACG TGTTGATAGC CGCAGCGTGG CAGCTTAGCG GTGGTGAATC ACCCGATGTA
ATGCTTTGGA GTGGCCAATT CCTCAACTGG TTGCAGAGTG TGACGGCGGC AGCGTTGGCC
TTGCTGTATT GGCGACGACC GGTGGCGGCG AGTGTCGCTG CCGTTATCGT TGGCTGCATT
TCGTGGATGC CGGCTTACTA CGTATCGTGG GGTCGTTATA CCCAACTAAG CGGGTTACTG
TTGTTGGTCG GGTTGGCAAT CAGTTGGCAC TATTGGCTAC ATACCGGCGA CAGACGCGAG
CGGATCGGCT GGCTGGCCCT GATCATTCTC ACGGCTACCG GTCTCAGTCT CGTGCATATG
CGAGTCTTTG TGTTCGGCGG GGCGCTGATT GCCGCGGAGA GTCTCGTGTG GTGCATGCGT
CAATCACGGC GCACCATCAG CATCTACGCG GGTCTCGCCG GGCTGGTCGG GGTTGGTGTT
GCGCTCCTGG CAGCACCGTG GTGGTGGTTA ATCCTCCGGC GCATCTTATT ACCGGCAGCA
GTCGGCGAGC AGAGTTTAAC CACCGGCGGT TCGTATGCCC GACTCAGCTA CGACTTGATG
TGGATTGGCC CGAACGAGTG GTTGGTCGCA TTGGCGTTGC TCGGCGCACT GCTCGGCGTT
GCTCGGCGGC AACGGGCGAC TACCATCCTC ACGTTGTGGG TAGGTGGTTT GGTGGTGCTG
ACCAATCCGT GGCTGATCAG CTTTGTGTTG CCATCGGTAG GTGTGATTGT ATTGATTGGC
AGTGTTGTGC AGCGCCGTTG GCGTTGGGCT GGGATTGGGC TGCTCTTGCT GGCGATTAAC
CCGGCGACAG TACATCTGCC CTTTCTCTGG CTACTCCCGG TTGACATTGC TGCGATCAGT
CTTTTTTTAC CCCTCAGCAG CCTGATCGGC GGTGGGGCCA CATTGATCTG GCCGTCCCGA
CGCTGGTTGC AGATCGGTGG TGCTTGTGCT TTGATCGGGG TTGCCCTGTG GGGTGCGCAG
CAGCAACGCA ACATCGTGAA TCCGATCACC GTGTTGGCTA CTGCTGACGA CCGCGTTGCT
ATGCGGTGGA TCGGTGAACA TACTCCTACA GATGCACGCT TTCTCATCAA CGCAGCACCG
TGGCTACCGA CCATCGAGCG CGGCAACGAT GGTGGTTGGT GGATTACACC TCTCACCGGG
CGGTGGACAA GTACGCCGCC GGTTCTGATC ACCTACGGCG ACCCAACCGC CTTGCGCGTA
GCGCGTCAGC TCAGCCAGCA AGTGATTGCG ATTGGCAACG GGCAACCGAT TGATCTGGTT
CAACTGGTCA CCGATGCTCA GATCGACTAC ATCTACACTT CGCCGGCCGG CCCACTAACT
CCGGCACAGA TTGCCGGCCT GGGTTATGAG CCGGTATACA CACAGGGTGG TGTTGTGATT
TATCAGGTGC GAGCACGGTA G
 
Protein sequence
MNGREFQPVS TTVGGEQLTT SRLSAGRRWL LSGLVIGLLV AMTTTGRAVL ATAFWLIAPG 
YLIERYLPGA RPHWLLRIAL WVGIGLSILP LVYLWTTAIG GAVTPSLLIG GAFLLAISTG
VAAWHDLAAV HDRPSTATWV LLTVLVITAG LRAVQIADLA FPPWVDSVHH ALMIRVAAET
GRVPWTLTPY LPVVDMPYHW GYHVLIAAAW QLSGGESPDV MLWSGQFLNW LQSVTAAALA
LLYWRRPVAA SVAAVIVGCI SWMPAYYVSW GRYTQLSGLL LLVGLAISWH YWLHTGDRRE
RIGWLALIIL TATGLSLVHM RVFVFGGALI AAESLVWCMR QSRRTISIYA GLAGLVGVGV
ALLAAPWWWL ILRRILLPAA VGEQSLTTGG SYARLSYDLM WIGPNEWLVA LALLGALLGV
ARRQRATTIL TLWVGGLVVL TNPWLISFVL PSVGVIVLIG SVVQRRWRWA GIGLLLLAIN
PATVHLPFLW LLPVDIAAIS LFLPLSSLIG GGATLIWPSR RWLQIGGACA LIGVALWGAQ
QQRNIVNPIT VLATADDRVA MRWIGEHTPT DARFLINAAP WLPTIERGND GGWWITPLTG
RWTSTPPVLI TYGDPTALRV ARQLSQQVIA IGNGQPIDLV QLVTDAQIDY IYTSPAGPLT
PAQIAGLGYE PVYTQGGVVI YQVRAR