Gene Cagg_2198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2198 
Symbol 
ID7266771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2692649 
End bp2694154 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content57% 
IMG OID643567029 
Productprotein of unknown function DUF333 
Protein accessionYP_002463517 
Protein GI219849084 
COG category[R] General function prediction only 
COG ID[COG3042] Putative hemolysin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGCCA GCCTACATCG CACGCTTACG CTGGTCTTTG CCATTGTCTT GGTAGGTCTC 
AACACCGGCT GCGTCCCTGT TCCCAATTCA GCGCAACCCA CAACCATTGC ACCAACCACG
ACAGCACTTC CGTCCCGACC ACCCACTGCA ACCCCGGTGC CAACCGCGAT GACCGGCCTC
GCGAATCCGG CTTCGGTCTA TTGTAGTGAG CAGGGTGGTT ATCTTGAGAT GCGGACTACT
GGTGATGGTG GGCAAATCGG GGTTTGCCTT TTTGCCGATA ACAGCCTCTG CGAAGAATGG
GCATTCTACC GTGGTGAATG TCGCCCCGGT GAGCAGTATG ATTCGACGAC GATACGACCT
GACCCGACCG GCATCCCCGC ACCGATCGGC GAGCTACTGG CACTCTTCCG AGCTAACCTA
CCGGCAAATG CCTTCAACGA CCTGGCTGCT CAACCGGTAC CAACCACCGA CGGCAGTCAA
CTCTGGATCG TCTACAGTAC CGGTATGCGC AATTTCGACC TTAACCCGCT TGTCCCGCAT
ACGCTAGCAC TTTACACCTA CACCGATGGT CGCTGGCAAG AACGGGGACG AACTACGCTG
AGTACAGAGT CATTCACCGA TGGACCAGAT TTTGTGGGGA GTGTACAACA AGTGCAGATC
GCCCCTGGGC GCATCTGGTT GCAGATTGAA GGTGGGATCG GTGCGCACGG TGGCAGTTAT
CATCTCCTGA GTTTCGACGG TACTGAGTTA CGAACCGAAG TGGCCGCCTT CTCGCCCTAC
CCCGGTTTTG GTCATACCGA GGATCTCGAC GGCGATGGAG TCCGTGAGGT TGTCCTTAAT
CGTTCCGAAC CGTACATCTT CTGTTATGCA TGCGGCGTCT ACTACCCGGC GTATCAGGTC
TACCGATGGC AAGATGAACG CATGGTCGCA TTACAGATTA GTGACCTGAC AGATGGGCAA
ACCGAACCAT TCGCCGATCT CAACCGGCAA GCGATCACCT CAGCGCAAGC CGATCTATGG
GCCGATGCCT TAGCGGCGAT CAATGCAGCC GTCGCACAAG CCGGTACCGC CGATCCGACC
ACGCAAGCTG GCACACTGCG GTGGAATCAG CGTCTGATCC AGATGACGCA TACAGCGCAC
ATGAACGCAA TCGCTGAGAG TGCTTACCCG CTGCTCAACA AGGTGTTCGC CGGTGATTAC
GACGGCGCTG TAGCCGAGAT GCGTGCGTAC CCGCCGCAAG CGATCTTCAA TGCCGAGTCG
CCACTGATCG TCGGTACCGT CGCCGAAGGA TGGGTCGAGA CATTAAGTGA ATATGTGCGC
ACCGAAGCTG AAAAAGCTGC CGGTGTCGCA CCCGAACGGG CTGCGATTTA CGTTATCTGG
GCCTGGGGAC GTTTTCTCGC CGATCCAACC GACCCGGCCA TCGGTACCGA TCTGGAGCGT
GCCGCCCAGT TGCAACCCGA TGACCCGTTC TTCACCGATA TCGCAGCGTG GTGGGCATCA
CGGTAA
 
Protein sequence
MIASLHRTLT LVFAIVLVGL NTGCVPVPNS AQPTTIAPTT TALPSRPPTA TPVPTAMTGL 
ANPASVYCSE QGGYLEMRTT GDGGQIGVCL FADNSLCEEW AFYRGECRPG EQYDSTTIRP
DPTGIPAPIG ELLALFRANL PANAFNDLAA QPVPTTDGSQ LWIVYSTGMR NFDLNPLVPH
TLALYTYTDG RWQERGRTTL STESFTDGPD FVGSVQQVQI APGRIWLQIE GGIGAHGGSY
HLLSFDGTEL RTEVAAFSPY PGFGHTEDLD GDGVREVVLN RSEPYIFCYA CGVYYPAYQV
YRWQDERMVA LQISDLTDGQ TEPFADLNRQ AITSAQADLW ADALAAINAA VAQAGTADPT
TQAGTLRWNQ RLIQMTHTAH MNAIAESAYP LLNKVFAGDY DGAVAEMRAY PPQAIFNAES
PLIVGTVAEG WVETLSEYVR TEAEKAAGVA PERAAIYVIW AWGRFLADPT DPAIGTDLER
AAQLQPDDPF FTDIAAWWAS R