Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2454 |
Symbol | |
ID | 4568882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2828691 |
End bp | 2829581 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639767013 |
Product | hypothetical protein |
Protein accession | YP_912866 |
Protein GI | 119358222 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.842306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGC AGGAGAGCGC CCATAAACAG TTGACCGACG CTATACGGTA TGTTGAGATC CGGTCTCGCC GGATGGCCAG CGAGCTGTTC AGCGGTGAAT ACCACTCCTC ATTCAAGGGA AAAGGGATAG AGTTCAGTCA TGTGCGTGAA TATCAGTATG GCGACGATGT GCGTGCTATT GACTGGAACA CATCTGCCCG CAAGCACGAT CTCTATGTCA AGCAATTTAC GGAAGAACGG GAGCGGACGA TGCTGCTCCT TGTTGACGGC TCGGCATCAA TGCTGTTCGG CAGCAGAAAG CGCAGTAAAC GCGATCTTGC TCTTGAAGTA AGCGCAGTCC TTGCGTACAG CGCCATACAG AATAACGACA AGGTTGGTCT TCTGGTTTTT ACCGACAGGA TAGAAACCTT TATTCCGCCC GCAAAAGGTC GTCGGCAGGT GCTGGTGATT CTTGACGCAT TGTTCAACCT GCAACCGGAA AATCGCAATA CCGATATTAC CGCAGCTCTT TCGTTTGTAC GTTTTACACA GAAACGAAAG GCGATTATTT TTCTTTTGAC CGATCTTGAT GACCGCAACT ACGAGGGGGG GATGAAGCTG TTGAACACTC GCCACGATTT TGTACTTGTG CATCTCAGCG ATCCGCTTGA CAAAGCGCTT CCCCGAACGG GACTTTTGCT GCTTGAGGAT CCTGAAACAC GGAGAAGCAT GATCGTCGAT GCCGGCAGCA GAAAAAAAGT TGAACGCTAC CGGGAAGCGC AGAGCAGGTA TTATGACGAT CTGAGGCAGA GATTGCGGCG TATGAAAATC GATACGGTTT TTCTCGATAC GGATCGTTCC TTTATCAGCG ATCTCAATGC TTTTTTTCGC TATCGTGAGC AGAAAATCTA A
|
Protein sequence | MEKQESAHKQ LTDAIRYVEI RSRRMASELF SGEYHSSFKG KGIEFSHVRE YQYGDDVRAI DWNTSARKHD LYVKQFTEER ERTMLLLVDG SASMLFGSRK RSKRDLALEV SAVLAYSAIQ NNDKVGLLVF TDRIETFIPP AKGRRQVLVI LDALFNLQPE NRNTDITAAL SFVRFTQKRK AIIFLLTDLD DRNYEGGMKL LNTRHDFVLV HLSDPLDKAL PRTGLLLLED PETRRSMIVD AGSRKKVERY REAQSRYYDD LRQRLRRMKI DTVFLDTDRS FISDLNAFFR YREQKI
|
| |