Gene Cagg_3611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3611 
Symbol 
ID7269755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4386649 
End bp4387839 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content57% 
IMG OID643568418 
Product2-alkenal reductase 
Protein accessionYP_002464884 
Protein GI219850451 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.249525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.817359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGAAC GAACCGGTCG CGGTCGGGGC ATTGTGTTGT TCATCGAACT GTTTTTGCTG 
CTGGTCATTG CCGGTGCGTT CGTATGGACA GCTTTGGTGA ATGATCGTCG GGAAACCGTG
GCGGTTACGC CATCGCCAAC ACCGGTAACG TTAACCATGC CTACGATCAT TCCAATGACC
GCGACCGATC TGGAAGCGCA GATCGCCGCC GTCTATCGTG AGGCTGGGCC GAGTGTGGTA
AATATCACCA GTCGCTCGAT CAGCTACGAC TTTTTCTTTA ATCCGGTGCC CCGCCAGGGT
AGTGGCTCGG GCTTTTTCTA CGATACCGCA GGCCATATTG TCACCAACTA TCACGTGGTG
GCCGATGCCG ACGAATTGCA AGTAACGCTC GCCGATGGGC GTACTGTGTC GGCCAAGATT
GTTGGCAGTG ACCCATCGAA CGATCTGGCC GTGATTAAGG TTGATTTACC GGCCGATGAA
ATTCGGCCAC TGCCGATCGG TGATTCGACG CAGGTCTACG TCGGCCAATT TGTGCTCGCC
ATCGGTAATC CGTTTGGTCT TGAACGTACC CTCACTTTTG GGATTATCAG TGCGCTTGGA
CGGGTCATCG AGAGTCCGAA CCAGCGCTTC ATCGGCGAGG TGATTCAGTC TGATGTGGCG
ATCAATCCCG GCAACTCTGG CGGGCCGCTG CTCGATTTGT CAGGACGGGT GATTGGGGTC
AATTCAGCCA TTCTCAGTCC CAGTGGGGCC AATGCCGGCA TCGGTTTTGC TATCTCGGCG
CGAACGGTGC AGCGCGTAGT ACCGGTGTTG ATCCGGGAAG GGCGCTATCC GCATCCATCG
TTGGGAGTGC GCCTAATCGA ACTGACGCCG CAACGTGCTG CTCTGTTTGA GCGGGCCGGT
ATGAATTTGC CGACCAAGCA GGGTCTGCTT ATTGCCGAGC TGATCGAGGG TGGCCCGGCG
GCTCGAGCCG GTTTGCGAGG CCCGCAGCAG GTCGTGCGGG TCGGAAACTG GATTTTGCCG
GTTGGTGGCG ATATTATCGT GGCAATCAAT GGCCGCTCGA TTACGAGCAG CCAAGAGCTG
TTGGTCTATC TCGAAACCGA AACCCAAGTG GGTGAAACGG TGCAAGTAAC CGTTATCCGG
GATGGTCGCG AGCGGATTAT CCCGGTGACG TTAGCCGAAT TGTCGTCGTA A
 
Protein sequence
MSERTGRGRG IVLFIELFLL LVIAGAFVWT ALVNDRRETV AVTPSPTPVT LTMPTIIPMT 
ATDLEAQIAA VYREAGPSVV NITSRSISYD FFFNPVPRQG SGSGFFYDTA GHIVTNYHVV
ADADELQVTL ADGRTVSAKI VGSDPSNDLA VIKVDLPADE IRPLPIGDST QVYVGQFVLA
IGNPFGLERT LTFGIISALG RVIESPNQRF IGEVIQSDVA INPGNSGGPL LDLSGRVIGV
NSAILSPSGA NAGIGFAISA RTVQRVVPVL IREGRYPHPS LGVRLIELTP QRAALFERAG
MNLPTKQGLL IAELIEGGPA ARAGLRGPQQ VVRVGNWILP VGGDIIVAIN GRSITSSQEL
LVYLETETQV GETVQVTVIR DGRERIIPVT LAELSS