Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0098 |
Symbol | |
ID | 4570665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 114296 |
End bp | 115678 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639764700 |
Product | hypothetical protein |
Protein accession | YP_910592 |
Protein GI | 119355948 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCC ACACTGAACA TGAGTTCAGG GAACCCGACA ATTTATCTAT GGAAATCCTT TTTCTTCTTT TTCTCATTAT TATCAACGGC ATTTTTGCCA TGTCGGAGAT CGCTCTGATT ACGACAAAGC GTACCCGTTT GACAAAACTT GCTGAAAGCG GCGATAAATC TGCGGCTGCC GCCCTGAAAC TCGGCCATGA ACCGACAACC TTTCTCTCCA CCATCCAGAT CGGGATCACC TCGATAGGCA TCCTCAATGG TATTGTCGGC CAAGGAGCTC TTGCCGAACC CTTTGCCGTA TGGCTCGAAT CGCTCGGCAT GGCTCATGAC GCCAGTCACA TTGGGGCAAC CGCCGTTGTC GTCATCTCCA TTACCTACAT CACCATCGTG GTCGGAGAGC TTGTCCCCAA ACGCCTCGGA CAGTTCAATC CCGAAGGGGC CGCTCGACTC TTTGCCCGCC CCATGCTGAT GCTTGCCACC ATAGCCAGAC CGTTTGTCCG TCTGCTTTCA GTCTCGACCG ATACCTTGCT GAGGCTTATG GGAAAAAGCC CGCAGGCCAT GCCAAGCGTG ACCGAAGAGG AGATTCACGC CATGCTTGAA GAGGGATCTG AAGCCGGAGT CATCGAGCAG CAGGAACACG ACATGGTTCG CAACGTCTTC AGACTCGACG ACCGTCAGCT CGGATCGCTT ATGGTACCGA GGGCGGATAT TGTCTATCTC GATATAACCC GACCGCTTGA AGAAAATATC CTGCGAGTGA CGGAATCGGA ACACTCCCGC TTTCCGGTCT GTAATGGAGG AATGCAATCG CTTCTCGGAG TGGTCAATGC CAAACAGTTG CTCTCCCAGA CACTGAAAGG AGGCCTCAAC GACTTCACCT CGCAACTGCA ACCCTATGTC TATGTTCCCG AAACCCTGAC CGGCATGGAG CTGCTCGATC ACTTCAGAAC ATCCGGCACC CAGATGGTCT TTGTTGTCGA CGAATATGGC GAAGTACAGG GTCTGGTAAC CCTGCAGGAC ATGCTCGAAG CGGTAACCGG CGAGTTCGTT CCCCGCAATC TTGAGGATTC CTGGGCGGTT GAACGGGAAG ACGGCTCATG GCTGCTTGAC GGACTGATCC CCGTGCCTGA GCTCAAGGAC AAACTCGAAC TGCTCAGCGT GCCCGAAGAG GATAAGGGGC TCTATCATAC GCTCAGCGGC CTGCTGATGT GGAGCCTCGG ACGGATGCCG CAAACCGGCG ATGTTGCAAC CTGGGAAAAC TGGCGTCTTG AAATCGTTGA CCTTGACGGA AACAGGATTG ATAAAGTACT TGCATCGAAA ATCCCGCCTC CCGATACCAA CGGTCTCAAA CAGAACGGCT CGGCGGCGGG GGAAAAAAAC TGA
|
Protein sequence | MKRHTEHEFR EPDNLSMEIL FLLFLIIING IFAMSEIALI TTKRTRLTKL AESGDKSAAA ALKLGHEPTT FLSTIQIGIT SIGILNGIVG QGALAEPFAV WLESLGMAHD ASHIGATAVV VISITYITIV VGELVPKRLG QFNPEGAARL FARPMLMLAT IARPFVRLLS VSTDTLLRLM GKSPQAMPSV TEEEIHAMLE EGSEAGVIEQ QEHDMVRNVF RLDDRQLGSL MVPRADIVYL DITRPLEENI LRVTESEHSR FPVCNGGMQS LLGVVNAKQL LSQTLKGGLN DFTSQLQPYV YVPETLTGME LLDHFRTSGT QMVFVVDEYG EVQGLVTLQD MLEAVTGEFV PRNLEDSWAV EREDGSWLLD GLIPVPELKD KLELLSVPEE DKGLYHTLSG LLMWSLGRMP QTGDVATWEN WRLEIVDLDG NRIDKVLASK IPPPDTNGLK QNGSAAGEKN
|
| |