Gene Cagg_3659 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3659 
Symbol 
ID7268194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4448826 
End bp4450001 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID643568465 
Producthypothetical protein 
Protein accessionYP_002464931 
Protein GI219850498 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000101981 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCGGTA ATCAGAATGA ATTACGTCGC TTCGGCGCCG CTCTGTTGAT CGGAATAGCG 
GCCGGTTTAG CGGCGCGCTA CTATTTCGAC TCTCGTGCTC GTAATGAAAG TCGGGTGCCC
ACCGGCTTGA TCGACTGGGA GCAGGCCCGT CAGGCGGCAT TACGTTTGTC GCAGTGGGAG
CAAGCCCCGG TTGACGATCG CCATTTTCGC CGCCAGCAGT ACGCCCAGAT GGTGGCGCAA
AGTGAACCTC TCATCGCCGA GTATCTTGGC GTGCAATTAC CCGAACCGGT CAATCGAATT
TTTGTCTTCG ACCGGCGGGA ATGGCTCGAA GCGAATATTG TCTCATTTAG CCAGCTCTTC
CGCCCCCTCG AAGAGGTGTA TGAAAAGAAT GGTGGCGGGC GTGGTGCATT GGGGGTGATG
GTTAACGACG TCAGCAGTAA GTTGCTGGGG ATGCAGATCG GTGGTCTCCT TGGGTATCTG
GCTCAGCGTG TGCTCGGTCA GTACGACTTA AGTCTGCTCT CGGCCGAAGC GACCGGTGGT
TCGCTGTACT TTGTCGAACC GAATATTGCC CGTGTCCAGC AGCAACTCGG CCTGAACGAT
ACCGATTTTC GGCTCTGGAT TACGCTGCAC GAGATGACCC ACGCCTTTGA GTTTGAAGCG
TATCCATGGG TGCGTCGCTA TTTCCGTGAA CTGATCGAGC AGAACTTTAC GCTCATCAGC
GGCCAAATGC TGGGTAACGG CAATAATCTG ATCGATATTA TGATGCGGCT GGTGCAAGGG
GTCGGGAGTG GTCAACATTG GATCGAATCG GTATTGACAC CCGATCAGCG GGTGGTGTTT
GATCGGATTC AAGCACTGAT GTCAATTATT GAAGGTTACG GCAACCATGT GATGAACGCG
GTTGGTCGGC GCTTGCTACC GAGTTTCAGC CAGATCGAAC ATCAGATCGC GCAGCGGCAG
CGGCAGCGAA CGTTACTCGA TCAGATGGTC TTTCGCTTAA CCGGCCTCGA TCTCAAACTA
GCCCAATATC AGCAAGGTGA GGCATTTGTC AATGCGGTAG TGGCCGCACG CGGGATCAGA
TTTGCCGGTC GCGTCTGGGA ACGGCCCGAA CATCTGCCGT CAATGGAAGA GATCCGCAAT
CCGGCGATGT GGATTGCCCG CATAGAACGT ATGTAG
 
Protein sequence
MVGNQNELRR FGAALLIGIA AGLAARYYFD SRARNESRVP TGLIDWEQAR QAALRLSQWE 
QAPVDDRHFR RQQYAQMVAQ SEPLIAEYLG VQLPEPVNRI FVFDRREWLE ANIVSFSQLF
RPLEEVYEKN GGGRGALGVM VNDVSSKLLG MQIGGLLGYL AQRVLGQYDL SLLSAEATGG
SLYFVEPNIA RVQQQLGLND TDFRLWITLH EMTHAFEFEA YPWVRRYFRE LIEQNFTLIS
GQMLGNGNNL IDIMMRLVQG VGSGQHWIES VLTPDQRVVF DRIQALMSII EGYGNHVMNA
VGRRLLPSFS QIEHQIAQRQ RQRTLLDQMV FRLTGLDLKL AQYQQGEAFV NAVVAARGIR
FAGRVWERPE HLPSMEEIRN PAMWIARIER M