Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0114 |
Symbol | |
ID | 7266852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 158764 |
End bp | 160206 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643564986 |
Product | hypothetical protein |
Protein accession | YP_002461502 |
Protein GI | 219847069 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.54756 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.147271 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAATAG CATTTGTGAT CGAGGTTCAT ACACCGTATG TGCGCCATCC CGGACGGCAT CCGATTGGTG AGGAGCGCTT ACATACCGTC ATTGCGCAGT TGTTGATACC GTTACTCGAT CTGCTCGGCG AATTGCAACG TCACCACCTC CCGATCACAA TCACCCTCGC CTGTTCGCCA ATTGCATTAG AACAGTGGCT TGATCCGATT GTGAGTAAGC ATTTCGGGCA ATGGTTAGAG GATCGTGTTA CGTATCATCA AGCTGAGTTG AACCGTTTTG AAGCGGAAGG CAACCGTCAT GGAGCCTACT TAGCCCGGTT TTACCTCGAT TGGGATCGTC AACTCCTACG CACGTTCACG ACTCGATACC GCCGGAATCT GGTAGGGCGG CTGCAAGAAC TCACCCTTGC CGAAATCGTC GTCCCGCTCT GTTTGCCGGC CAGCCATGCG ATTCTGCCTC TTTTGAGTCG TGAGAGTATG GTGCGCGCAC AACTCGAACA CGGTTTATTG TACATTTCGC GCCATCTAAA ACGACCGGAA GGCCTGTGGT TACCCAACCG TGCCTGGCGA CCGGGGATCG AGCAAATTGC CCTCGAACTC GATCTGCGTT ACGTGTTGGT TGAGCCGACG AGTGTCGCAT CGGGATCATT ACCCGGCTGG ATCGTGCCGC GCCGGCTGGC GGCAATTGGG ATTGACGATG CGCTGGCCCA CCATGTGAAT TCCGTCGAGC TGGGCTATCC CGGTGACCCG CTCTATCGCA ATCCCGACGA TCCTACCGGC TATACTGCGA ACGGTACACA TACGCCACAG CCCTATGATC CATATGATGC GCTCCGTCGT GCGCAAGAAC ATGCCAACCA TTTTGTCGAA CAACTGCTAC ACCGCATCCA ACAACTGCCT GCGGAAGCGG TTGTTGTAGT CCCGATTGAT ACACGCTTGT GGGGAAGCAG ATGGTTTGAG GCACCAACTT GGTTCCAGGC CGTCTTGACC CGTTGTGCCA CAGATACCCG CTTACGCCTA ACGCATCCAG GAACGGCGTT GACCGATTTG CGGGTAAGTG ACGTGGTGAC GTTACGCCCC GATCTCGCGA TCTCCAATAA GCACGGACGC CGGCAGAATA TCGTTTCACA GCGGTATTGG CAAGCGCTGG CCGATGCCGA ACAGCGCTTC GCCGATTTGG TCGCTACCTA CCCATCTGCG GAGGGTCTCC GCGAGCGTGT ACTGACGCAG GCTGCCCGCG AGCTTTTCCT TGCCGAACAA AGTGATTGGA TCGATGCACC ACACGAGTTG GGCTGGCAAC GCCACCTTGA TCGCTTTGAG CAGTTGTTGA TCCTTGCGCG CCAAGAGTCG CTCAGCGCGA CCGATCTCTT CACCCTCGAA CAAATCGAGA CCTACGATGC GATCTTTCCG GTGCTTAATT ATCGTTTGTT TGGACGAGGG TGA
|
Protein sequence | MPIAFVIEVH TPYVRHPGRH PIGEERLHTV IAQLLIPLLD LLGELQRHHL PITITLACSP IALEQWLDPI VSKHFGQWLE DRVTYHQAEL NRFEAEGNRH GAYLARFYLD WDRQLLRTFT TRYRRNLVGR LQELTLAEIV VPLCLPASHA ILPLLSRESM VRAQLEHGLL YISRHLKRPE GLWLPNRAWR PGIEQIALEL DLRYVLVEPT SVASGSLPGW IVPRRLAAIG IDDALAHHVN SVELGYPGDP LYRNPDDPTG YTANGTHTPQ PYDPYDALRR AQEHANHFVE QLLHRIQQLP AEAVVVVPID TRLWGSRWFE APTWFQAVLT RCATDTRLRL THPGTALTDL RVSDVVTLRP DLAISNKHGR RQNIVSQRYW QALADAEQRF ADLVATYPSA EGLRERVLTQ AARELFLAEQ SDWIDAPHEL GWQRHLDRFE QLLILARQES LSATDLFTLE QIETYDAIFP VLNYRLFGRG
|
| |