Gene Cagg_1437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1437 
Symbol 
ID7269269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp1768246 
End bp1769448 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content55% 
IMG OID643566280 
ProductRNA binding S1 domain protein 
Protein accessionYP_002462780 
Protein GI219848347 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.245145 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.159598 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGAGT TGAAGCATGC CGGTAATAAC ACAAACAATG AAGACGTTCG CCAACCTGAT 
CTAAACGATC TGATGGAGCG TGGCGATCAG GCGCTGATGG AACAGATACT GAGCGATCCG
GCCCATACCT ACCGTAACCT CAAACACGGT GACACCGTTG ATGGGCGGAT CATGCGGATC
GACCGAGATG AAATCTTGGT CGACATAGGC GCCAAAGCCG AAGGTGTTGT CCCTAGCCGG
GAGATGCAGA CCCTCAGTGA AGAAGATCGA GCTGCGCTCA AAGTTGGTGA TACCATTCTT
GTTTTCGTTG TCCAATCGGA AGACAAAGAA GGCCGTGCGA TTTTATCAAT CGATAAAGCT
CGGCAAGAAA AAAGCTGGCG CGCATTGCAA GAGTATTACG AGCGGGGCGA AATTATCTAT
GCCCGCGTCA AGAATTACAA CAAGGGCGGC CTGCTAGTCG ATCTCGATGG TGTGCGCGGG
TTTGTCCCTG CGTCGCAGGT GTCGAGTGTT AGCCGTGCTT CGGAGGCGCA AAAGCAATCC
GAAATGGCGC GGCTGGTAAA TGTTGAGCTG CCGCTAAAAG TGATTGAGAT CAACCGCAAC
CGCAACCGCC TGATTCTGTC CGAACGGCAG GCGTTGGTCG AAACCCGTGA GACGAAGAAA
GACGAGTTGC TCGCATCGTT ACAAGAGGGT GATGTGCGCG AAGGAGTGGT CTCGTCGGTC
TGCGATTTCG GTGTCTTCGT CGATATTGGC GGCGCCGATG GGTTGGTGCA TCTGTCCGAG
ATCTCGTGGT CGCGCGTCAA ACATCCGAGC GAAGTGCTCA AGGTGGGTGA TAAAGTCAAA
GTGTCTATCC TGAACATTGA CCACGAGCGC AAACGGATCG CGCTATCGAT CAAGCGGACC
CAAAGCGAGC CGTGGACACG GGTGGCCGAA CGCTATCAGT TGGGGCAAAT TGTCGAAGGA
ACAGTGACGC AACTGGCCTC GTTTGGCGCC TTTGTACGGA TTGAAGATGG GGTGGAAGGG
CTGATCCACG TCTCAGAAAT GGGTGATGAG CGTATTCAGC ACCCACGCGA CGTGCTAAGC
GAGGGTCAAG TTGTGCAGGC ACGGATCATC CGTATCGATC CGGCACGGAA GCGGATGGGG
TTGAGTTTAC GGCTCCAACA AGAGCCGTCG GAGGGCGGCA CCGCAACGGA GGAGGCTGGC
TAA
 
Protein sequence
MEELKHAGNN TNNEDVRQPD LNDLMERGDQ ALMEQILSDP AHTYRNLKHG DTVDGRIMRI 
DRDEILVDIG AKAEGVVPSR EMQTLSEEDR AALKVGDTIL VFVVQSEDKE GRAILSIDKA
RQEKSWRALQ EYYERGEIIY ARVKNYNKGG LLVDLDGVRG FVPASQVSSV SRASEAQKQS
EMARLVNVEL PLKVIEINRN RNRLILSERQ ALVETRETKK DELLASLQEG DVREGVVSSV
CDFGVFVDIG GADGLVHLSE ISWSRVKHPS EVLKVGDKVK VSILNIDHER KRIALSIKRT
QSEPWTRVAE RYQLGQIVEG TVTQLASFGA FVRIEDGVEG LIHVSEMGDE RIQHPRDVLS
EGQVVQARII RIDPARKRMG LSLRLQQEPS EGGTATEEAG