Gene Cagg_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3775 
Symbol 
ID7267849 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4604729 
End bp4606348 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content56% 
IMG OID643568583 
ProductRNA binding S1 domain protein 
Protein accessionYP_002465047 
Protein GI219850614 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.704215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGATC TCGATGATTA CGAAGGCGAT CCGGCCCTCA ACCGTGAGCG CCTGAGTGAA 
CTGTTAACCG ATCAACTCGA AGAGCTGGCC CGCGCGATTC ATTCGCGCGA TCAACTGGTG
CGAGCACGCG CGGCCAGCCG ACTGGTCAAT CTTGAGGTCG ATCCTGATCT GGTGTTACCG
ACCCTGCACC ATCATTGGCC GGCCGTGCGT GAGGTTGCCA TCGAAGCCAT TGGGTATACC
GGTAAACCGC TGAGTCCTGC GGTCATCGAT GCCTTATTAG CAAGTATCGA TGACCCGAAA
CCGTTTGTAG CCGCCGCGGC AATCCGCACG TTAGGGCGAA AACAGATCGC AGAGGCACGT
GAACAAATCA CAGCTTGTCT CGATGATCCA GATCCTCCCA TCGTTGCTGC CGCCATCGCC
GCACTTGCCC GCCTTGGTGA TACCACGCTC GCTGTAGCCA TTCCCAACTT TCTCAACAGC
CCACACCTTG CCATCCGTAT CGCGGCTGCC GAGGCGGCGG GAATACTGCA TACTCCGGCA
GCCGTCCCCG GGCTGTTACG CTTGCTCGAA GATTGCATAA CGGCGTGGCA GGAAACTCAG
CCCCATATTC CCAGCCGAGC GGCAAGTGTC GCAATGCAGG CATTAGCGCG CTTACGTGCC
CGCACGGCGA TACCACTCCT CGTTGAAATC GCCCGCTATG TCGTCGGTCT ACGAACATTA
GCAGTGCGTA CTCTCAACCA ACTACAAGCC GTAGAAGCAG CTCCGGCGAT CGCATCACTC
CTTCACGAAG AAGGCGGTCA TCTCTTACAC GAAGTCATTC GTTTGGTGAA GATGGCCGAT
TACCGGGCTG CACTACCTGA ATTACGCGCT CTTCTCCAAC GCTCTGCCCC TAACCGACGA
TCATTGATGA TCAAGATTAT GCAGATTCTG GTCGAATGGA ATGACCGGGC GAGTATGCCA
TTACTTGCCC AACTGGCCGA GAGCTTTCCC AACGCCGAGA TTCGTCATCA TGCGGCCCGC
TGCCTCACCA TCCTAGAGCA AGCGACCACT ACACCCGAAG AACCATCGCC ACCGTTACCA
GATCCTGCAC CAACAGTATT ATGTAGTGAA CGTCTCCGCA AACGGCAAGA GCGGATCGCC
TCCGTCAGCG TTGGGAGCAT CGTTGAGGGA ACGGTGTTGC GCGTATTGAG TTATGGAGCA
GTGATCGATC TCGGTGGGAT AGAAGGGTTT GTTCACGTGC GCGACATCGA CTGGCATTGG
ATCAGCGACG CACGCAACGC GCTGCAACTC GGCCAACCGG TCCGTGCGAT GATTACCAAC
ATTGACCGGC AGCATCTGCG TATCAATCTG AGCATCCGCG AACTTACCCC TGATCCGTGG
GTAAGTCTCT CGCAACACCT TGCCGGCGGC ATGACGGTGC AAGGAACTGT TACCGGTATC
ACCGGTTTTG GTCTGTTTGT CGAACTCTTA CCCGGCATCC AAGGCCTCGC CCATATCAGC
AAAATTCCGG CGAAGCGCCG ACCATTACGT GAATGGTTCC CACTCGGTAG TCAGGTGATG
GTCACGATCC TCGCGATCGA TAACGAGCAC CGACGCATTG CGCTGAGTGT TAATGAATGA
 
Protein sequence
MNDLDDYEGD PALNRERLSE LLTDQLEELA RAIHSRDQLV RARAASRLVN LEVDPDLVLP 
TLHHHWPAVR EVAIEAIGYT GKPLSPAVID ALLASIDDPK PFVAAAAIRT LGRKQIAEAR
EQITACLDDP DPPIVAAAIA ALARLGDTTL AVAIPNFLNS PHLAIRIAAA EAAGILHTPA
AVPGLLRLLE DCITAWQETQ PHIPSRAASV AMQALARLRA RTAIPLLVEI ARYVVGLRTL
AVRTLNQLQA VEAAPAIASL LHEEGGHLLH EVIRLVKMAD YRAALPELRA LLQRSAPNRR
SLMIKIMQIL VEWNDRASMP LLAQLAESFP NAEIRHHAAR CLTILEQATT TPEEPSPPLP
DPAPTVLCSE RLRKRQERIA SVSVGSIVEG TVLRVLSYGA VIDLGGIEGF VHVRDIDWHW
ISDARNALQL GQPVRAMITN IDRQHLRINL SIRELTPDPW VSLSQHLAGG MTVQGTVTGI
TGFGLFVELL PGIQGLAHIS KIPAKRRPLR EWFPLGSQVM VTILAIDNEH RRIALSVNE