Gene Cagg_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2025 
Symbol 
ID7269184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2485305 
End bp2486504 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content55% 
IMG OID643566860 
Producthypothetical protein 
Protein accessionYP_002463349 
Protein GI219848916 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCG GTGAGATGCT CAAAATGATG CGCGACCCGC TCGGCGCGCC GTTTTATCCG 
CCAGCGTTGC AGGTGTTATT GGTCGTAACG TGGGTGTTGC ATATCTTCTT TGTTACGCTC
GCCTTGGGAT CAAGTTTGTA CGCGATTTGG GGCTTTCTCC GTCCTACCGA CTACCGATTA
CGGCTGGCAC GGGTCGCTGC CCGGCTCACC CCAAATGCGG TTGGGCTAGG GATTGTTACC
GGCATTGCAC CTCTGCTGTT TGTGCAGACG ATCTACGATC CAATTTGGTA TGCCAGCAAC
TCTTTGACCG GGTTTTGGTC GGTCAGTTTC ATCTTTGTGG TCATGGGCGG CTACAGTCTC
GCATACCTGT TCTACCTGAA GGGCAGCCCT GACGGGAAAT TGCTTTGGTC GGCGGTGGCA
TCGTTCATCT TACTCTTCTT CGCCGGCTGG ATTATGCATG TGCTGGCGGC AGTCTCGATC
CGACCTGAAC GCTGGATGGA GTGGTATGCG CCGAATGGCA TTATTGATAC ACGCGGAATC
ATCTTCCATG CGTGGAACAT TCCGCGGCTT GTGTTTTTGT TGCCCTTGCA AGCCTGCCTG
AGCCTTGCGG TGACGCTGAC TTTGTTCGGT TGGTATTTCC GTCGGAGTGA GGAAGATGCA
CCTTTTATAC AATGGGTGGC CAACCTTGGT CGGAAACTAG GCCTTGTGAT TAGCCCGATC
TACGCGCTTG CCGGCTTGCT CTGGGCGATG ACCGAAGGTG TTGAGTTCGG TATCGGTTGG
CAAGTCGGGA TCACGTTAGT GGGCATCGGA GTAGCACTGA CCGGCTATTT CTTCTGGCTG
CGCCAACCAA TCCGCCATGC GCCGCGCACG TTGCTCGTTT GGATAGGAAC GCTGGTTGTG
GTAGGTATGG TACGCGAAGC GATCCGGGTT GTCTCACTAG CACGATTCGG GTACAGTGTC
GCTACCTACC CCTATGCCTT CGATTGGGGA TCGATTATCG TATTTACCGT AACCACCATC
GTCGGCGTCG CAGTACTCGC GTATCTCATA ATGGTGATGT ACCAGTCTGG TGGGGTGAAG
CGTGATGCGC AGATCTCTCC CCGTGTGGAG CGGCTTGGTA CGATTGCTAC CGGTATGTTA
GGGGCGTGGT TTGGCTTCTT CCTGCTCGTT GGCTTGTATG CCACCTTCTT GCTAAGGTGA
 
Protein sequence
MNVGEMLKMM RDPLGAPFYP PALQVLLVVT WVLHIFFVTL ALGSSLYAIW GFLRPTDYRL 
RLARVAARLT PNAVGLGIVT GIAPLLFVQT IYDPIWYASN SLTGFWSVSF IFVVMGGYSL
AYLFYLKGSP DGKLLWSAVA SFILLFFAGW IMHVLAAVSI RPERWMEWYA PNGIIDTRGI
IFHAWNIPRL VFLLPLQACL SLAVTLTLFG WYFRRSEEDA PFIQWVANLG RKLGLVISPI
YALAGLLWAM TEGVEFGIGW QVGITLVGIG VALTGYFFWL RQPIRHAPRT LLVWIGTLVV
VGMVREAIRV VSLARFGYSV ATYPYAFDWG SIIVFTVTTI VGVAVLAYLI MVMYQSGGVK
RDAQISPRVE RLGTIATGML GAWFGFFLLV GLYATFLLR