Gene Cagg_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2026 
Symbol 
ID7269185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2486508 
End bp2487914 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content54% 
IMG OID643566861 
Producthypothetical protein 
Protein accessionYP_002463350 
Protein GI219848917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCCGA TTTGGGAAGG TATCTTCATC CATTCGGCCA TGTACGTGGC GATTGTTTCG 
GCATTCCACG TACTTGCCTC ACACCTGACC GTCGCTGCGG CATGGTTCAA CCTCTATCTC
GAACGACGGG CGGTTTATGA AAAGCGTCCT GAACTGTACG AGTACTTACG CCGGAGCGCA
TTAGGCTTGC TCGTCTTTGC GTATGTCTTC GGCGCGATGG CCGGCGTCGG TATCTGGCAA
ACCACTACCG CAGCGAACCC ACGCGGTATT TCGACGCTTA TCCACAATTT CGTCTTCTAC
TGGGGAGCTG AATGGTACAT GTTTTTGATT GATGTCGTTG GTATTATCGC CTACTACTAT
TCGTTCGGTC GGATTGATCC GAAAACACAC CTACGGTTGG CATGGATTCT CGCCCTTGGC
GGTACCGGCA CATTATCTAT CATCGTAGGA GTTTTATCAT TTAAGCTCAC CCCCGGTCTG
TGGTTAGATA CCGGTGTGAG TCTGAACGGT TTCTTCAATC CCACCTTCTG GCCACAAATC
TTCCTGCGCT TTGCGCTTAT GTTCCCGATT ACCGCAGCGT GGGCGCTGCT CATTGTGACT
GGCATGCCCA AAACATACCC CGAGCGCGAA CCGATTATCC GCAATGCGGC CCTGATGGGT
TTGGGCGGCT TGGCGGTCGC GCTGGCTATT TTCGTCTTCT GGTTCTACCC AGTGCTGCCC
GAGCACGCCA AAATTATTAT GCGAACCCGC GCCATACCGC CGATTACCTA TACCGTTATT
CTTGGCGGAA TCGCCGCGAC GTTTGCCGGT CTGCTGTTTG CGTGGCGGTT TCCCCAGCGC
CAGCAACGCC TGATTGCGTT GGGTGCGCTG TTCGTGTTGT TTGCCGCGAT CTTTGGCGCC
GAACGCACCC GCGAAGTCTT ACGCAAACCC GATATTATCG CCGGCTATAT GTCATCGAAT
CAGCTCGTTT TCAACGATCT GCCAGCCCGT AGTATCCAGA GTGAAGAGCA GCGGCTCAAT
GAGACCGGTA TGCTGGGGTC GTTGCCATTT CTCCCTCGGC CTGACCAGAT CGTGCTGCCG
GCCAATAGTG GATTGCCCAA TCAGACAATT GCCGTCGGAC GCACCTTAGT CATGCAGCAG
TGCGCCTCGT GCCACAATGT GAGCCAGCAA ACGGCATTGA TTGGGTTCAA CCAACGCTTG
GCGTTACGGT CACTGGCCGA TCTGTTATAC CTCCGTCGAG CCACGACGGC TGATCTGATC
AAGTCGCGCA TCCGCGCAAT TGGCGGGTTC CAGTATATGC ATCCGGTGGT CGGCACTGAA
GAAGAACTTT CCGCTATGGC TCAATATCTC GAATACTTTG TCCAGCAGGT GCATCCGTCG
CAACCCCAAG TAGTCACGCA GAGGTGA
 
Protein sequence
MYPIWEGIFI HSAMYVAIVS AFHVLASHLT VAAAWFNLYL ERRAVYEKRP ELYEYLRRSA 
LGLLVFAYVF GAMAGVGIWQ TTTAANPRGI STLIHNFVFY WGAEWYMFLI DVVGIIAYYY
SFGRIDPKTH LRLAWILALG GTGTLSIIVG VLSFKLTPGL WLDTGVSLNG FFNPTFWPQI
FLRFALMFPI TAAWALLIVT GMPKTYPERE PIIRNAALMG LGGLAVALAI FVFWFYPVLP
EHAKIIMRTR AIPPITYTVI LGGIAATFAG LLFAWRFPQR QQRLIALGAL FVLFAAIFGA
ERTREVLRKP DIIAGYMSSN QLVFNDLPAR SIQSEEQRLN ETGMLGSLPF LPRPDQIVLP
ANSGLPNQTI AVGRTLVMQQ CASCHNVSQQ TALIGFNQRL ALRSLADLLY LRRATTADLI
KSRIRAIGGF QYMHPVVGTE EELSAMAQYL EYFVQQVHPS QPQVVTQR