Gene Cagg_2605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2605 
Symbol 
ID7267196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3187128 
End bp3190076 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content57% 
IMG OID643567431 
Producthypothetical protein 
Protein accessionYP_002463910 
Protein GI219849477 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.715518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.847276 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTATC GCCTTCCCAC CCGATCTAGG ATAGTGTTCG GCATCATCAG CCTAACACTG 
CTGATCATAC TCACACCGCT TTTGGCATGG ACAGCCCCGA CCCAAACGCA GGTTGTTTCA
CTTTCCACCC ATCGCTACGA TCTCCAGATC ACTGCCGGCA GCACGCTCGC ATGGCGCAAC
GACAGTTCCG ACTTCCACCG GCTACGGGCC ATTGACGGCA GTTGGGAGAC GCCGCTCATG
CGGCCCGGCG AGGTCGTGAC CCAGACCTTC ACCGTTAGCG GCCAATCGGC ATTTTTGTGC
GATATTGACC CAACGATGCG CGGTACGGTC AGCGTGCAGG CAGCCCATAC CGTATTCGTG
CCGATGGTGG CGAGTCAAAC GTTTGAGCGC TGGTCGCAAC CGCAGACGTG GGGTGGTCGT
TTGCCTCAAG CAGGCGATGC GGTGCAGATT CCCGCCGGCA AAGTGATCTT GCTCGACGTT
AGCCCACCGC CATTACAGAG TTTGCTCATC GAGGGCGAGC TGATCTTTGA TCGGCGTGAT
CTGGATTTAA CCGTCGGCTG GATTATGATC CACGGCCAAG GCCGCCTACG CATCGGCAGC
CCCACCGCGC CATTTGCGCA ACGAGCGACG ATTACCCTGA CCGCAACCGA TCCGAATGAA
AATGTGATGG GGATGGGCAC GCGCGGCATC TTGTTGATGG GTGGATCGTT TGAGGCGTAT
GGCGTAACGC CAAACCATCC GTGGACGGTC TTGAACAATC ACGCGGCTGC CGGTACGCGC
GAACTGATAT TGCGCGACAC GGTTGACTGG CAGATCGGCG ATCAGGTAGT GATTGCGCCG
ACCGACTTCT TTGGCGTAGC CCAAACCGAA CGCCTCACCG TCGAGGCGGT TGACGGCACA
CGGGTGCAGG TAAGTACACC GCTGCAACAA GCGCGGTGGG GCCGCTTGCA GTACGTGAAT
AGCAGCGGGA TGACTCTGAC GCCGACGAAC GAGGTAACAC CGCTCGTCCT TGATGAACGG
GCCGAGGTAG GCAATCTCTC GCGCCGGATT GTGATTCAGG GGGCCGACGA TGATCGCTGG
CGTAATGACC GTTTTGGCGC CCAGATCATG GTCATGAACA ACGCCAGCCT ACGGCTCGAT
GGTGTAGAGT TACGACGGGT CGGACAAGGT GGTCGGCTGG GTCGTTACCC GATCCATTTT
CATTTGTTGT CGTATGATGC CGACGGCAAC TGGACCGGTG ATGCCACCAA CAACGTGATC
ACCAATTCGA GCATCTGGAA CTCGGTCAAC CGCTGCATTG TCATTCATGG CACAAACGGC
ACTACCATTC GTAACAACAT CTGCTATGAC ATCGCCGGTC ACGCTATCTT CCTCGAAGAT
GCGGTCGAGC GGCGCAATGT CATTGAGGGG AATCTAGTGC TGCGCGTGCG CCAACCGCCA
CAGCCGCTAA TCGCCAGTGA CCGGAGCAGT TTTCGGCGGG GGCCGTCTGG GTTCTGGCTG
ACCAACCCTG ATAACACGGT ACGGGGCAAT GTTGCCGCCG ATACCGAGGG CAATGGGTTT
TGGCTGGCTT TCCCCGATCA ACCACTCGGC GCGAATAAGC GTGTACCGAT ACGGCCGGTT
CATCTCCCGC TTGGCATCTT TAGCCACAAC GTCGCCCATT CTAACAGCAA ACCGGGCATT
AACATCGATT TTGCTCCGTT TGACGATGAA GGTAATACAA AAGAAATCAA ATACATCCCA
ACCGTTAACG GTGAGCCATT CCGTTACGAG AACCGCGTGC GCTTCACGCT GAGCGACATT
ACAACCTACA AAAACAATGA TAATGGGTTG TGGAATCGGG TGTCGTGGCC CGATTATGTA
CGGTTTGTAT CGGCAGATAA CGTTGGGATG TTCTTTGCCG GCGCCGGTGA TAGTGGCAAG
ATTGTTGATT CGCTGATCAT TGGGGAGAGT TTGAACAACC AATCACCGCG ACCGACCACC
GATCAGCCCA ACACAGCGGT TGCCAGTTAT CACAGTACGT TCGACATTGA CAACAACGTC
ATCGTGAACT TTCCACTCCA CAACCGGCTC GACCGGGCCA GTGGTGCGTT TGCAACTAAT
GATTACTACA CTCGTGCAGT TGATCGCGGC TTGATTCGTA ATCCGCACAA CCGCCTGATC
AATGCCCATC CGGGTCGGCG AGTTATCTCA CCGAACATCA ATACACCCGC CGGCAATGCA
GCGTTAGCCG GCGCATTGTG GGACCCGCAC GGTTATTGGG GACCGGCCGG CAATTATTGG
GTGTACGACA TTCCTTTCTT AACGGTGGGG CAGACGTGTG TCGCCGTCGC ACCGGTCGGC
CAGAATGGGC AGAGTTGTCG CGGGCCGTAC TATGGTGTGG GTGGATTGCG CATCGACAAC
GGCGATCCGT ATAAGCCGCG GATGCCACTC ACCGTTACCC GGCTCGATGC AACCAATCAA
CCGATAGCAC AGTGGATCGT CACCGAAGGG AGGGGGAGCG GCACCAATAC CTTTGGCATT
ATGCCGTGGA TGCGTCACTT CACCGCAGTC ACCGGTGGGC GGTATCGAGT CGAGTTCCGT
GATGGAACGA CGACTACCCC ACTACCGCTG CAAGAGTTGA AGATCACGCT GAGCAACATG
CACACCACAG ATGATCGGCT TATCTTGGCG CTTCCCTTTG GTGGCAGCGG AACGGTAGAA
GCGTATCTGA CCACGCGCGA AAACTATCAA GATTCGGCAC CGGGTGCCGC CGAGCGGCGC
GATCTCACAC CGGTTACGTC GTTCGCGGCG TTGGTCGCCA CCGACAATAG CTTTTGGCAC
GATACGGCCT CGCAACAGGT ATGGGTGAAT GTACAGGGTG GCGTACCGAG CTGGAACGGC
GCACCGCTCG ATCCGTTATC CGATACGGCA CTCTATCGAG AGACGATGTT GCGGATCTAC
CGTCCCTAA
 
Protein sequence
MHYRLPTRSR IVFGIISLTL LIILTPLLAW TAPTQTQVVS LSTHRYDLQI TAGSTLAWRN 
DSSDFHRLRA IDGSWETPLM RPGEVVTQTF TVSGQSAFLC DIDPTMRGTV SVQAAHTVFV
PMVASQTFER WSQPQTWGGR LPQAGDAVQI PAGKVILLDV SPPPLQSLLI EGELIFDRRD
LDLTVGWIMI HGQGRLRIGS PTAPFAQRAT ITLTATDPNE NVMGMGTRGI LLMGGSFEAY
GVTPNHPWTV LNNHAAAGTR ELILRDTVDW QIGDQVVIAP TDFFGVAQTE RLTVEAVDGT
RVQVSTPLQQ ARWGRLQYVN SSGMTLTPTN EVTPLVLDER AEVGNLSRRI VIQGADDDRW
RNDRFGAQIM VMNNASLRLD GVELRRVGQG GRLGRYPIHF HLLSYDADGN WTGDATNNVI
TNSSIWNSVN RCIVIHGTNG TTIRNNICYD IAGHAIFLED AVERRNVIEG NLVLRVRQPP
QPLIASDRSS FRRGPSGFWL TNPDNTVRGN VAADTEGNGF WLAFPDQPLG ANKRVPIRPV
HLPLGIFSHN VAHSNSKPGI NIDFAPFDDE GNTKEIKYIP TVNGEPFRYE NRVRFTLSDI
TTYKNNDNGL WNRVSWPDYV RFVSADNVGM FFAGAGDSGK IVDSLIIGES LNNQSPRPTT
DQPNTAVASY HSTFDIDNNV IVNFPLHNRL DRASGAFATN DYYTRAVDRG LIRNPHNRLI
NAHPGRRVIS PNINTPAGNA ALAGALWDPH GYWGPAGNYW VYDIPFLTVG QTCVAVAPVG
QNGQSCRGPY YGVGGLRIDN GDPYKPRMPL TVTRLDATNQ PIAQWIVTEG RGSGTNTFGI
MPWMRHFTAV TGGRYRVEFR DGTTTTPLPL QELKITLSNM HTTDDRLILA LPFGGSGTVE
AYLTTRENYQ DSAPGAAERR DLTPVTSFAA LVATDNSFWH DTASQQVWVN VQGGVPSWNG
APLDPLSDTA LYRETMLRIY RP