Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2531 |
Symbol | |
ID | 4569721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 2901190 |
End bp | 2902791 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639767096 |
Product | hypothetical protein |
Protein accession | YP_912943 |
Protein GI | 119358299 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000854266 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTACTG TAAAAAGCAC CTGGGCAACA AGCCCTATCG TTGCAAACGG AATCCTGTCA GCACCCGAAT GGGCTGCAGC AGGAGTAATG CCGATTCCTG CCGGATTTAT GATGGTCAAA AATGACAACA CGTTTCTCTA TGTTGCGCTC GATATGGTCG GTGATTCAGG TGCCGACCCG GGTGTGGGTG ATTATTTCTG GTTCAGCATT GACTGTAACG GTAATGGAGC CATCACTCCC GGAGTGGATG TCAACTATAG CATCTATCCC ATGCTTCCTG TACGTATCGG GCGTCAGTAC TATCTCGGAC CGGGCAGATG GACAGGACTG CAAAACACGC CAAGCCCTGC TATCGCAACC AATGGATTCG GACCTTCGCC CCGCTCCCGC ACGTCGCACC GTATCTGGGA ACTGCGTATC CCCCTTTCTG AAATCGGGAT TACAGACCTT GGCTCACACC CTGATCCTGC CGTGCATTTC GGATTGAGAG TAGCATCCCG GAATCCTGTA TTCACCTTTG ATTTTCCGGC AGGCTTCTAT TCGAACTTCT CAAACCTGCA TGCTATTCTG CTGGCAAAAA CACCAACGTT CGCTCCCGGT GTTGCCGGGC CGGTGATAGG GTCTGTGGGT CTGATTCCGG CAACGTCACC ACAAATCAAT GGCGGCTATG CAACAACGGC TCCCTCATAT TATCTGCATG TCGATGAAGC CGCATTCGGC GGAACGCTCA CCATAATCGG AAACCGTACC ACCATGCAAA GCCTTTGGGC CGCAGGAGCA CGAAAATACC GTATCCTCCA CCGCGCAGGA AGCAGCGGAG CCTTCACCCC GCTTCTTCAA AGCTGGAATA ATTACCGTTG GAACGGGGCA ACCAGTACTC TTGAAAGCAT CGCCTGGGAC AGCCTCCAGA TGTATCCGCT GCCGGACCCG ACAAAAGATT ACTCCATTGA TGATCTGCTG ATACAATGGA ACACCATCGG TTCAACAGGC ATTCACGAAC TGAAAGCGGA ATTTTTCAGA GACGATCCCG CGCGCACCCC TGTTGCCTCA GCTCCGCAAA CACTTCAGCT CATGATTGAC AACAATATCC CGTATACAGA CATTATCAAC GTGCTGCATG ACGGTGCTCC TGTTGCAGCA TGTGCAATGG AAACCATGAC CAGCGCAACT GACGGAGTGC AGATAACAAT TACGGTGACG GATCTCGAAG CCCATCTGAA AGACTTCACA CTCAAGGCCC ACTGGGGAAA CGGAGAATCG ACTACCGTTT ATTCGGACAA CTATCCTGCA CACCGTAATC CGCTGCATCA GTGGAGCGGC GTTACCTCAC TGGTTGTGCC CCCTGCTGAA TGGGTTCCGC CGCGCACCTG CGCCTACCAG TTCAGACTTT CGGCAACAGC GAGAGTAACC AACGGCTACA CGTATATCGG GTATGTGGAA GACGCCTATC ATGTAACCCT TATCAAACCG GGTAGCCCCG CTCCAAGAGC AATCAAATTG ACCGAAAGTA TTCTGCCGTT CGGGCAACTG TCAGGAGAAG CCGCCCCTGA AATCGGTATT GAACCGAAAA AACTGGGAAT AGAAACGTTC CCGGTACGGT AG
|
Protein sequence | MATVKSTWAT SPIVANGILS APEWAAAGVM PIPAGFMMVK NDNTFLYVAL DMVGDSGADP GVGDYFWFSI DCNGNGAITP GVDVNYSIYP MLPVRIGRQY YLGPGRWTGL QNTPSPAIAT NGFGPSPRSR TSHRIWELRI PLSEIGITDL GSHPDPAVHF GLRVASRNPV FTFDFPAGFY SNFSNLHAIL LAKTPTFAPG VAGPVIGSVG LIPATSPQIN GGYATTAPSY YLHVDEAAFG GTLTIIGNRT TMQSLWAAGA RKYRILHRAG SSGAFTPLLQ SWNNYRWNGA TSTLESIAWD SLQMYPLPDP TKDYSIDDLL IQWNTIGSTG IHELKAEFFR DDPARTPVAS APQTLQLMID NNIPYTDIIN VLHDGAPVAA CAMETMTSAT DGVQITITVT DLEAHLKDFT LKAHWGNGES TTVYSDNYPA HRNPLHQWSG VTSLVVPPAE WVPPRTCAYQ FRLSATARVT NGYTYIGYVE DAYHVTLIKP GSPAPRAIKL TESILPFGQL SGEAAPEIGI EPKKLGIETF PVR
|
| |