Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_0824 |
Symbol | |
ID | 3746823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1150406 |
End bp | 1151470 |
Gene Length | 1065 bp |
Protein Length | 354 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637773354 |
Product | DHH family protein |
Protein accession | YP_379133 |
Protein GI | 78188795 |
COG category | [R] General function prediction only |
COG ID | [COG0618] Exopolyphosphatase-related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.534288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTATTC CCTCCTACGG TCGCACCCTT CACGCTGAAG AGTGGCAACC GCTCCTTGAG CCGCTGCTTG CAGCTCAACA CCTTGTTTTA ACAACGCACG AAAATTCTGA TGGCGATGGC TTAGGGTGCG AAGTTGCCCT TGCTCTTGCT CTTACGGCTC TTGGCAAAGA GGTTTCCATT GTGAACCCAA CGGAAGTACC GCCCAACTAC CAATTTTTGA GGCAACTCTA CCCAATAGTT CAATTTAATC CCAAAAGTGA AGAGGCAATT CAAGAGCTTT CGCTGTGCGA TGCCGTGGTG CTGCTTGATG CCAATTTAAG CGACCGCATG GGAACCTTGT GGCCTCACGT TCGTTTTGCA CGCGAGCTTG GTAGTTTAAA GCTTCTCTGC GTTGATCACC ATCTTGAACC AAATGATTTT ACCGATGTTA TGATTTCGGA GTCGTATGCC TCCTCCACTG GCGAGTTAGT ATATGGCTTA ATTCTTGCTA TGGAACAAAG TGTTGGGCGT GCGCTCTTTA CACCCAATAT TGCTCAAGCG CTCTATGTGG CGGTAATGAC GGATACGGGT TCATTCCGAT TTTCAAAAAC AACTCCATAC GTTTATCAAT TAGCGGGCGA TTTAGTGGCG CGTGGGGCTA ATCCCGAAAA AGCATACGAT TTAATTTTTA ATTCGCTAAC GCCTCAAGCG CTCAAATTAC TTGGCTTGTC GTTAAGCGCT ATTTCTCTTG TTGAGGGGGG AAAACTTTCG TGGCTGCTTA TTTCACAAGA GATGTTAAAA GCAACGGAAA GTAAGTTGTT TGATACTGAT ATTATTGTCC GTTATCTTTT AAGTGTGCCC TCAGTTGCCA TAGCGGTACT TTTAGTTGAA ATGCAAGATG GACGTACCAA AGCAAGTTTT CGCTCGCGTG GCAAGTTGCC CGTTAATAAA CTTGCTAAAG AATTTGGCGG CGGTGGGCAT ATGAATGCGG CTGGTGCGCT TTTTCCCTAT ACGCCCGAAA AGGTACAACA AGTGCTTCCG CAAGCTGTGC GTCGCTTTAT AAAAGAGCAT GAAGCGCTGC TGTAA
|
Protein sequence | MIIPSYGRTL HAEEWQPLLE PLLAAQHLVL TTHENSDGDG LGCEVALALA LTALGKEVSI VNPTEVPPNY QFLRQLYPIV QFNPKSEEAI QELSLCDAVV LLDANLSDRM GTLWPHVRFA RELGSLKLLC VDHHLEPNDF TDVMISESYA SSTGELVYGL ILAMEQSVGR ALFTPNIAQA LYVAVMTDTG SFRFSKTTPY VYQLAGDLVA RGANPEKAYD LIFNSLTPQA LKLLGLSLSA ISLVEGGKLS WLLISQEMLK ATESKLFDTD IIVRYLLSVP SVAIAVLLVE MQDGRTKASF RSRGKLPVNK LAKEFGGGGH MNAAGALFPY TPEKVQQVLP QAVRRFIKEH EALL
|
| |