Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3775 |
Symbol | |
ID | 7267849 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 4604729 |
End bp | 4606348 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643568583 |
Product | RNA binding S1 domain protein |
Protein accession | YP_002465047 |
Protein GI | 219850614 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0539] Ribosomal protein S1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.704215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATC TCGATGATTA CGAAGGCGAT CCGGCCCTCA ACCGTGAGCG CCTGAGTGAA CTGTTAACCG ATCAACTCGA AGAGCTGGCC CGCGCGATTC ATTCGCGCGA TCAACTGGTG CGAGCACGCG CGGCCAGCCG ACTGGTCAAT CTTGAGGTCG ATCCTGATCT GGTGTTACCG ACCCTGCACC ATCATTGGCC GGCCGTGCGT GAGGTTGCCA TCGAAGCCAT TGGGTATACC GGTAAACCGC TGAGTCCTGC GGTCATCGAT GCCTTATTAG CAAGTATCGA TGACCCGAAA CCGTTTGTAG CCGCCGCGGC AATCCGCACG TTAGGGCGAA AACAGATCGC AGAGGCACGT GAACAAATCA CAGCTTGTCT CGATGATCCA GATCCTCCCA TCGTTGCTGC CGCCATCGCC GCACTTGCCC GCCTTGGTGA TACCACGCTC GCTGTAGCCA TTCCCAACTT TCTCAACAGC CCACACCTTG CCATCCGTAT CGCGGCTGCC GAGGCGGCGG GAATACTGCA TACTCCGGCA GCCGTCCCCG GGCTGTTACG CTTGCTCGAA GATTGCATAA CGGCGTGGCA GGAAACTCAG CCCCATATTC CCAGCCGAGC GGCAAGTGTC GCAATGCAGG CATTAGCGCG CTTACGTGCC CGCACGGCGA TACCACTCCT CGTTGAAATC GCCCGCTATG TCGTCGGTCT ACGAACATTA GCAGTGCGTA CTCTCAACCA ACTACAAGCC GTAGAAGCAG CTCCGGCGAT CGCATCACTC CTTCACGAAG AAGGCGGTCA TCTCTTACAC GAAGTCATTC GTTTGGTGAA GATGGCCGAT TACCGGGCTG CACTACCTGA ATTACGCGCT CTTCTCCAAC GCTCTGCCCC TAACCGACGA TCATTGATGA TCAAGATTAT GCAGATTCTG GTCGAATGGA ATGACCGGGC GAGTATGCCA TTACTTGCCC AACTGGCCGA GAGCTTTCCC AACGCCGAGA TTCGTCATCA TGCGGCCCGC TGCCTCACCA TCCTAGAGCA AGCGACCACT ACACCCGAAG AACCATCGCC ACCGTTACCA GATCCTGCAC CAACAGTATT ATGTAGTGAA CGTCTCCGCA AACGGCAAGA GCGGATCGCC TCCGTCAGCG TTGGGAGCAT CGTTGAGGGA ACGGTGTTGC GCGTATTGAG TTATGGAGCA GTGATCGATC TCGGTGGGAT AGAAGGGTTT GTTCACGTGC GCGACATCGA CTGGCATTGG ATCAGCGACG CACGCAACGC GCTGCAACTC GGCCAACCGG TCCGTGCGAT GATTACCAAC ATTGACCGGC AGCATCTGCG TATCAATCTG AGCATCCGCG AACTTACCCC TGATCCGTGG GTAAGTCTCT CGCAACACCT TGCCGGCGGC ATGACGGTGC AAGGAACTGT TACCGGTATC ACCGGTTTTG GTCTGTTTGT CGAACTCTTA CCCGGCATCC AAGGCCTCGC CCATATCAGC AAAATTCCGG CGAAGCGCCG ACCATTACGT GAATGGTTCC CACTCGGTAG TCAGGTGATG GTCACGATCC TCGCGATCGA TAACGAGCAC CGACGCATTG CGCTGAGTGT TAATGAATGA
|
Protein sequence | MNDLDDYEGD PALNRERLSE LLTDQLEELA RAIHSRDQLV RARAASRLVN LEVDPDLVLP TLHHHWPAVR EVAIEAIGYT GKPLSPAVID ALLASIDDPK PFVAAAAIRT LGRKQIAEAR EQITACLDDP DPPIVAAAIA ALARLGDTTL AVAIPNFLNS PHLAIRIAAA EAAGILHTPA AVPGLLRLLE DCITAWQETQ PHIPSRAASV AMQALARLRA RTAIPLLVEI ARYVVGLRTL AVRTLNQLQA VEAAPAIASL LHEEGGHLLH EVIRLVKMAD YRAALPELRA LLQRSAPNRR SLMIKIMQIL VEWNDRASMP LLAQLAESFP NAEIRHHAAR CLTILEQATT TPEEPSPPLP DPAPTVLCSE RLRKRQERIA SVSVGSIVEG TVLRVLSYGA VIDLGGIEGF VHVRDIDWHW ISDARNALQL GQPVRAMITN IDRQHLRINL SIRELTPDPW VSLSQHLAGG MTVQGTVTGI TGFGLFVELL PGIQGLAHIS KIPAKRRPLR EWFPLGSQVM VTILAIDNEH RRIALSVNE
|
| |