Gene Cagg_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2049 
Symbol 
ID7269208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2510246 
End bp2511193 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content54% 
IMG OID643566884 
Productputative sulfite oxidase subunit YedY 
Protein accessionYP_002463373 
Protein GI219848940 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00190211 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTGCG ATCCTACGAT ACGTTCGTCA GAGATCACCC CCGAATCGGT CTATTTCAGT 
CGCCGCACCA TCTTGCGCGG TTTGGGTGTA ATCGGATTGA GTGCGCTGCT GAATGCCTGC
GGCGTACCGT TATCTGCCGA TACTACCGGC TCGCCGTCGG CGAGCAACAC GACCGGTCTG
CGCGATGAGC TTGGTGATCC GGCGAACAGC TTTGAGCAAA TTACGAACTA TAACAACTTC
TACGAATTTA CGACGGATAA AGAGGATGTA GCCCGCCGCG CTGCCAATTT CGTCACTCAC
CCATGGACGG TTGAGGTGAC GGGAATGGTG CGTAACCCAC AGATTTTTGC AATTGAAGAT
ATCTTAACGC AGTTCGATCA AGAAGAACGG ATCTACCGTC TGCGCTGCGT TGAAGGCTGG
TCGATGGTCA TCCCGTGGCA AGGTTTTCCG CTTCGTAAAT TGCTGGCCAT CGTTGAGCCG
ACAGCACAGG CCCAATACGT CCGTTTTGAG ACCCTTTACG ACCCCGATCA GATGCCGGGA
CAACGCGATC GCTACTTTCC CTGGCCGTAT GTCGAAGGGT TGCGTATCGA TGAAGCAATG
CACGATCTGA CGCTCCTCAG CACCGGCCTC TATGGGCGAA CCCTCTTGCC CCAAAATGGT
GCGCCGCTAC GGTTGGTTGT TCCGTGGAAG TATGGCTTCA AGAGTATCAA ATCGATTGTT
AAGATCGAAC TAACCGACCA AATGCCGGTT TCGTTGTGGA TGGCAGTCGC CCCGCACGAA
TACGGCTTCT ACGCCAACGT TAATCCCGAC GTACCACACC CGCGTTGGTC GCAAGCGACC
GAGCGGCGTA TCGGCGAGTT GGGCCGCCGA AAGACGTTAC CGTTCAACGG GTATGCGGAA
CAGGTAGCGG CACTCTACGC CGGTATGGAT TTACGCAAGA ATTATTAG
 
Protein sequence
MYCDPTIRSS EITPESVYFS RRTILRGLGV IGLSALLNAC GVPLSADTTG SPSASNTTGL 
RDELGDPANS FEQITNYNNF YEFTTDKEDV ARRAANFVTH PWTVEVTGMV RNPQIFAIED
ILTQFDQEER IYRLRCVEGW SMVIPWQGFP LRKLLAIVEP TAQAQYVRFE TLYDPDQMPG
QRDRYFPWPY VEGLRIDEAM HDLTLLSTGL YGRTLLPQNG APLRLVVPWK YGFKSIKSIV
KIELTDQMPV SLWMAVAPHE YGFYANVNPD VPHPRWSQAT ERRIGELGRR KTLPFNGYAE
QVAALYAGMD LRKNY