Gene Cagg_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3044 
Symbol 
ID7267259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3702340 
End bp3703449 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content56% 
IMG OID643567864 
ProductRieske (2Fe-2S) domain protein 
Protein accessionYP_002464338 
Protein GI219849905 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000332714 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGACCT TTGTCCGCGC ATCACTTGAT CGTCCGGCCT ACACCTTGCC CGGCTATTAT 
TTTAGCGCAA CCGACATCTT TATACGTGAA CAAGACCGTA TCTTTGCGCG CACGTGGGTC
TGCGTCGGGC GGAGTGAAGA TGTCGCTACT GCCGGTGCGT ACTGTCTGAT CGAGGTCGCT
GGCGAAAGCC TTATTCTCCT ACGCGACCAG ACCGGTCAAC TCCATGCACA TTATAACGTC
TGTCGTCATC GCGGAGCACG TCTCTGCACA GAGTTGCAAG GTCAGTTAAG CGAAACGATT
GTCTGTCCGT ACCATGCATG GACGTATCGG CTCGATGGGA CGCTGGCGAC CGCGCGCTAT
ATGCAGGATG CGCCGGGGTT TCGCTGTGAA GACTGGCCGT TACTGAGTGC TGCCGTCGCC
GAATGGGATG GGTTTGTGTT CGTATCACTG GCCGAACAGC CCGTTGCATT CGAGCGCGCG
TTTGCGCCGC TCATCGGAAA GTTTCAGGCA TGGGACCCGG GGCGATTGCG CTGCGGCGCC
CAGATTGTAT ACGAAGTGGC GGCCAACTGG AAGCTGATTA TCGCTAACTA TTCGGAGTGT
TACCATTGCC CGCTTATTCA CCCCGAACTC GTAGCCGTTT CTCCGTGGCA AAGCGGGCGC
AACGATCTGA CGAGTGGCCC GTTTCTCGGT GGATATATGG ATCTGAAACA CGAGAGCATG
ACGCTAGACG GTCATACTCG TCGCTCGCCA TTGCCCGGTC TAAACGCCGA AGATCGGCGG
CGCGTTTATT ACTACGCTAT CTTTCCCAAT CTGCTCCTCA GCCTCCATCC CGATTACGTG
ATGGCGCATC GTCTCATCCC GCGACGCCCT GATGCAACGA CGATTGTCTG TTCCTGGTAC
TTTGCGCCGG AAGTGATGGC TCTACCCGAT TTCGATCCTT CTGATGCCGT TGAGTTTTGG
GATCACACCA ACCGCCAAGA CTGGCGCGTC TGTGAGTTAT CGCAGCAAGG AGTCAGTTCA
CGTGCGTATC GCCCCGGTCC GTATGCTCAA TCAGAAGGAT TGTTGTGGCA GTTCGATCAG
GAATATCTAC GGGTCATGGG TGAGGAATAA
 
Protein sequence
MTTFVRASLD RPAYTLPGYY FSATDIFIRE QDRIFARTWV CVGRSEDVAT AGAYCLIEVA 
GESLILLRDQ TGQLHAHYNV CRHRGARLCT ELQGQLSETI VCPYHAWTYR LDGTLATARY
MQDAPGFRCE DWPLLSAAVA EWDGFVFVSL AEQPVAFERA FAPLIGKFQA WDPGRLRCGA
QIVYEVAANW KLIIANYSEC YHCPLIHPEL VAVSPWQSGR NDLTSGPFLG GYMDLKHESM
TLDGHTRRSP LPGLNAEDRR RVYYYAIFPN LLLSLHPDYV MAHRLIPRRP DATTIVCSWY
FAPEVMALPD FDPSDAVEFW DHTNRQDWRV CELSQQGVSS RAYRPGPYAQ SEGLLWQFDQ
EYLRVMGEE