Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0519 |
Symbol | |
ID | 6374183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 544952 |
End bp | 546019 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642683036 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_001958963 |
Protein GI | 189499493 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATATTCC GGAAATTTAA CGGTATCTGT TTTGTCGCGC TGCTCTCTTT TTTTCTCGGT GGCTGCGCCT CAACCCTCGT GGGTTCCCTG GAAGAACTTC ACGAACCGCC TGTAGAAGCG TTTTTCGTAA CCGATCGCAA CGATACCGGT TTAAAGGATC CCGCAGAGAA ATACGGTAAA GAGCGCGCTT CGGTATCTTA CGGGATCTGC AGTGTATCCA TCCCTCCCGG TCATCGCATC GGGAAACTCG AAAGTCCCAC ATTCAGAAAG GACGTTGAGG AGCATATCGT GCTTGTGGAT GTGTCCGTTC TTGAAAAAAA AGATTTTTTT TCGAAGGTTT CTCATGCACT GAACCGTTCT GGCAAGAAGA CTATGCTTCT TTACGTGCAC GGTTATAATG TGACGTTTGA AAAGGCGGCC AGAAGAATGG TTCAGATTGT CGATGATCTT GATTTTAAGG GCATTCCGGT TTTCTACAGT TGGCCGTCTC AGGGAAGTGT CGGAGGATAT CCTGCTGATG CAGCCAGTGT CGAATGGTCG GAACAGAACC TTGGGGATTT TCTTGCGGAA GCTGCCCGGA TTTCGGGCGT AAATACCTTG TATCTTTTGG CCCACAGTAT GGGAAATCGT GCCTTGACTG GCGCTTTCCT GGATCTTGTC AGGGAAAAAC CGCATTTAAA AAGCCGTTTC AAGGCGCTGC TCCTGACCGC TCCGGATATT GATTCCGAGG TTTTCAGAAG AGATATCGGG CCGGGTCTCG CGGCCTCAGG GGCTGCGATT ACCCTTTACG CATCAGGCAG GGACAGGGCA TTGAGGCTCT CGAAAAGACT TCACGGATAT CCAAGGGCCG GGGATGTAGA CGGTTTTCCC CTGATCGTTC CCGGTATTGA GACGGTAGAC GCTACCCATG TGGATACAAG TTTTCTCGGG CACTCCTATT TCAACGGTTC GAGATCTGTA TTGTCGGATA TGTTCTATAT TCTCAATGAG GAGCTTCGGG CGGAACAACG GTTTTCACTT GAACCCGTTG ATACGCCTGA GGGGCGGTAC TGGAGATTCA AGGAGTAG
|
Protein sequence | MIFRKFNGIC FVALLSFFLG GCASTLVGSL EELHEPPVEA FFVTDRNDTG LKDPAEKYGK ERASVSYGIC SVSIPPGHRI GKLESPTFRK DVEEHIVLVD VSVLEKKDFF SKVSHALNRS GKKTMLLYVH GYNVTFEKAA RRMVQIVDDL DFKGIPVFYS WPSQGSVGGY PADAASVEWS EQNLGDFLAE AARISGVNTL YLLAHSMGNR ALTGAFLDLV REKPHLKSRF KALLLTAPDI DSEVFRRDIG PGLAASGAAI TLYASGRDRA LRLSKRLHGY PRAGDVDGFP LIVPGIETVD ATHVDTSFLG HSYFNGSRSV LSDMFYILNE ELRAEQRFSL EPVDTPEGRY WRFKE
|
| |