Gene Cagg_0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0114 
Symbol 
ID7266852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp158764 
End bp160206 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content56% 
IMG OID643564986 
Producthypothetical protein 
Protein accessionYP_002461502 
Protein GI219847069 
COG category[S] Function unknown 
COG ID[COG1543] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.54756 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.147271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAG CATTTGTGAT CGAGGTTCAT ACACCGTATG TGCGCCATCC CGGACGGCAT 
CCGATTGGTG AGGAGCGCTT ACATACCGTC ATTGCGCAGT TGTTGATACC GTTACTCGAT
CTGCTCGGCG AATTGCAACG TCACCACCTC CCGATCACAA TCACCCTCGC CTGTTCGCCA
ATTGCATTAG AACAGTGGCT TGATCCGATT GTGAGTAAGC ATTTCGGGCA ATGGTTAGAG
GATCGTGTTA CGTATCATCA AGCTGAGTTG AACCGTTTTG AAGCGGAAGG CAACCGTCAT
GGAGCCTACT TAGCCCGGTT TTACCTCGAT TGGGATCGTC AACTCCTACG CACGTTCACG
ACTCGATACC GCCGGAATCT GGTAGGGCGG CTGCAAGAAC TCACCCTTGC CGAAATCGTC
GTCCCGCTCT GTTTGCCGGC CAGCCATGCG ATTCTGCCTC TTTTGAGTCG TGAGAGTATG
GTGCGCGCAC AACTCGAACA CGGTTTATTG TACATTTCGC GCCATCTAAA ACGACCGGAA
GGCCTGTGGT TACCCAACCG TGCCTGGCGA CCGGGGATCG AGCAAATTGC CCTCGAACTC
GATCTGCGTT ACGTGTTGGT TGAGCCGACG AGTGTCGCAT CGGGATCATT ACCCGGCTGG
ATCGTGCCGC GCCGGCTGGC GGCAATTGGG ATTGACGATG CGCTGGCCCA CCATGTGAAT
TCCGTCGAGC TGGGCTATCC CGGTGACCCG CTCTATCGCA ATCCCGACGA TCCTACCGGC
TATACTGCGA ACGGTACACA TACGCCACAG CCCTATGATC CATATGATGC GCTCCGTCGT
GCGCAAGAAC ATGCCAACCA TTTTGTCGAA CAACTGCTAC ACCGCATCCA ACAACTGCCT
GCGGAAGCGG TTGTTGTAGT CCCGATTGAT ACACGCTTGT GGGGAAGCAG ATGGTTTGAG
GCACCAACTT GGTTCCAGGC CGTCTTGACC CGTTGTGCCA CAGATACCCG CTTACGCCTA
ACGCATCCAG GAACGGCGTT GACCGATTTG CGGGTAAGTG ACGTGGTGAC GTTACGCCCC
GATCTCGCGA TCTCCAATAA GCACGGACGC CGGCAGAATA TCGTTTCACA GCGGTATTGG
CAAGCGCTGG CCGATGCCGA ACAGCGCTTC GCCGATTTGG TCGCTACCTA CCCATCTGCG
GAGGGTCTCC GCGAGCGTGT ACTGACGCAG GCTGCCCGCG AGCTTTTCCT TGCCGAACAA
AGTGATTGGA TCGATGCACC ACACGAGTTG GGCTGGCAAC GCCACCTTGA TCGCTTTGAG
CAGTTGTTGA TCCTTGCGCG CCAAGAGTCG CTCAGCGCGA CCGATCTCTT CACCCTCGAA
CAAATCGAGA CCTACGATGC GATCTTTCCG GTGCTTAATT ATCGTTTGTT TGGACGAGGG
TGA
 
Protein sequence
MPIAFVIEVH TPYVRHPGRH PIGEERLHTV IAQLLIPLLD LLGELQRHHL PITITLACSP 
IALEQWLDPI VSKHFGQWLE DRVTYHQAEL NRFEAEGNRH GAYLARFYLD WDRQLLRTFT
TRYRRNLVGR LQELTLAEIV VPLCLPASHA ILPLLSRESM VRAQLEHGLL YISRHLKRPE
GLWLPNRAWR PGIEQIALEL DLRYVLVEPT SVASGSLPGW IVPRRLAAIG IDDALAHHVN
SVELGYPGDP LYRNPDDPTG YTANGTHTPQ PYDPYDALRR AQEHANHFVE QLLHRIQQLP
AEAVVVVPID TRLWGSRWFE APTWFQAVLT RCATDTRLRL THPGTALTDL RVSDVVTLRP
DLAISNKHGR RQNIVSQRYW QALADAEQRF ADLVATYPSA EGLRERVLTQ AARELFLAEQ
SDWIDAPHEL GWQRHLDRFE QLLILARQES LSATDLFTLE QIETYDAIFP VLNYRLFGRG