Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2534 |
Symbol | |
ID | 4569724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2905987 |
End bp | 2907069 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 639767099 |
Product | protein of unknown function DUF900, hydrolase family protein |
Protein accession | YP_912946 |
Protein GI | 119358302 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0983031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTTC AATGGCGTTT TGTCTCTTTT CTGGTTTTGG TTTTTCTTTT TGCCGGATGT ACGGCTACCA GCTCTCTTGT TCAACGTCAA ACCATCAGGG TGTTTTATGC GACCGATCGC GCCCTTAATG GGCAGTCCGA TCATTCTGAG CTGTATGGCG GGGAACGGGG CGCGGTGACC TATGGGGTTT GCGAGGTGGG TATTTCGCAG GGGCACGGTA TTGCTGAACC TGATATGAGA TTGTACGGGG AGCCCGACCG GAAAAATCCT GATTCGGATG CTGAGTTGCA GGCTGTCAAT CTTATATGCG AAATGGATTT TTTTTCTGAA CTTGATCGCA GCGTCAAGGG GTCGCCGTCA GGTGATTTGC TGCTTTTTGT TCATGGTTAC AACCTGACTT TTGAAAAAGC AGCTCTGAAT ACAGCACTGC TTTTCTGCGA TCTTGGCTTC AATGGAGCTC CGTTGTTCTA TAGTTGGCCG TCACGTGGAT CGATCAGCAA GTATGCTATC GATGAAACTA ATATCGAATG GTCTCAGCCT GACCTGAAAA GGTTTCTTGA AGCTGTTGCC AGAAGGTCTG GAGCGAAGGA TATTTATTTG ATGGCCCACA GTCTGGGAAA TCGAGCTATG ACTAAAGCGT TGATCGAACT GCTTCAGGAG CAGCCTCAAC TGAAAAACCG GTTCAAGGCG CTGATTCTTA TGGCTCCCGA CATTGATGCG GAAATTTTTA AACGAGATAT CGCCCCGAAA CTTACCGATA CAGGAGCCTT TGTAACGCTC TATGCTTCAT GCGATGATAA GGCGCTGCAA CTTTCAACAG ACGTGCATGG GTATATGAGG GCAGGGGATA TCTCGCAACG TTCGCTTCTT GTGCCGGGTA TCGAAATTAT CGATGTAACC AGTGTCGATA CCGGTTTTTT TGGACATTCA TACTACAAGG GTTCGCGACT GGTTCTCAGG GATCTCGCTT TTCTTATAAA CAGGGGATTT CATGCCAATG ATCGCTTGTC TCTTGAGCCG ATAGATTTGC CCCAGGGTCG TTTTTGGAGG ATAAAAAAAG ATGTAAAGCA TGCTTTGCCT TAA
|
Protein sequence | MRFQWRFVSF LVLVFLFAGC TATSSLVQRQ TIRVFYATDR ALNGQSDHSE LYGGERGAVT YGVCEVGISQ GHGIAEPDMR LYGEPDRKNP DSDAELQAVN LICEMDFFSE LDRSVKGSPS GDLLLFVHGY NLTFEKAALN TALLFCDLGF NGAPLFYSWP SRGSISKYAI DETNIEWSQP DLKRFLEAVA RRSGAKDIYL MAHSLGNRAM TKALIELLQE QPQLKNRFKA LILMAPDIDA EIFKRDIAPK LTDTGAFVTL YASCDDKALQ LSTDVHGYMR AGDISQRSLL VPGIEIIDVT SVDTGFFGHS YYKGSRLVLR DLAFLINRGF HANDRLSLEP IDLPQGRFWR IKKDVKHALP
|
| |