Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1820 |
Symbol | |
ID | 4571162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2076394 |
End bp | 2078154 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 639766402 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_912260 |
Protein GI | 119357616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.120135 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCAGA AGGAAAACGC GCTTAAAAAA CTTGCAGGTA ATGCTGTTTC GGGTATGGTG GCGACGATTA TTTATATGGT CAGCCGCCTT CTTCTTACAC CGTTCATTCT GCAGTATCTC TCTCTGGAGG AGTTCGGCTT ATGGTCGCTC TGTTTTATCA TTCTCTCTTA TGCCGGAATG GGAGGGTTCG GAGTAAACAG TACCTATATC CGTTATTCGG CAAGATACCT TGCGGAGGGA AAAGAGAAAG AGATCAGCAA GCTGCTCTCA ACCGGTGTTG CCTATATGTT TTCATTCTGT CTCTTTTTCT GTCTGGTTCT TTATCTGATT ATGCCTTTTC TTCTCGAAAG GTTCCATATA GCGCCTGCGC AGCAGGATCT TGCCTCAACA ATATTTCTTG GTACTGCTGC AGTTTTCAGT CTTGAACTTA CTCTTGGCGG ATTCCGGTTT GTCATTAACG GAATGCATGA GTTTTTAAAG GAGAAAATCG TTTCAACCGT TGCCGGACTC ATTGAGATTG GCGCTATCCT TCTGTTTCTT TACTTCGGAG CGGGGGTTAA AGGGCTCTTG TACGCGTTTG CCTTAAGGCT GGTTCTTGAA ACTATTGGCT GCTGGGCAAT TGCCCGATCC TTGCTTCCTT CGCTCTCTGT TTCATGGAGA TTGATCAGCC GTGAAAATTT CAGGCTTTTT CTCGGTTTTG GCGGCAAAGT CCAGGTGCTC GGCATTCTGG GTATCTTTCT TACGGCGCTT GACAGATTGT TCATTACGGC AATTGCCGGA CTTGCCGCAG GAGGCATGTT TGAGATAGGT CGAAAGCTCC CTTCAACGGC AGGGGGTATC TCATCATCCG CATTCGGTCC GTTTTTATCT ACCGCATCTC ATATCGAAGG CCGTTGGGCA GGTGAAAAAC CGGATGCTTT TCCGGACAGG CTTAAAACCT ATGGTCTTAT TGTTGCAACA ACCGTTACGC TATCCCTTGT CCCGCTTTTT TTTCTACTGC CCGTGCAAAA ACGGCTGCAG GGGGCAAGTC CGCTGATCGC TGTATTTGCA GGGGTTTTAA CCGTTGTTCT GTTTTATCTG CTCAATCGCA GAATGAAAAA TGAAAATTTT CTCGATAACA TTGAATTAAA GCAGCTTTAT CTCAACGGGA TTCGTTTTAC CAACATGATC AACTCGACAC TGTTTCTTTT TCTTGTCGCC ATGGCTCATC CACTGATGAA TGCATGGGTT GGCAAGGAGT ATGCGCGTGC TGCCGATGTT ATGATCTTTT TATCGACAGC CTACTCGATT CAATTGTGTA CAGGTCCGAT AACCATGATA TTTCGGGGAA TTGATCGTAA CGGAAGAGAG CTTGAGTACA TGCTGGTTCA GGTTATACTG ATGGTTATCT GGATTCCTGC CGGAACGATT GCATCGGGAT TGATCGGATC AGCAGCAGCT ATTGCGTGCA GTTCGATAGT CAGCACATGC TTTCTTTTCT GGCGGAGCAA TAACACGTTT CAGATTCGAT TCCGTAAATT TGTTTCAGTC ACCGTTATCC CTGCGCTTGT TCCTCTCTTG CCGGCAGTGG CTGTTTTTGC CGTTTCGGAG ATCTATCCCG CAGAAGGGAG ACTTGTGGCT GTCTTGCAGG TTCTTGTTTG CGGTGTTGTC TATGTGTTGC TTTCTGTCAT GATGTTCTGG AAATTTATCC TGAACGGCGA GGAAAAATCA AAAGCACTGG AAATGATACC TTTTAACCGG AAACGGAATC CTCCATGCTG A
|
Protein sequence | MNQKENALKK LAGNAVSGMV ATIIYMVSRL LLTPFILQYL SLEEFGLWSL CFIILSYAGM GGFGVNSTYI RYSARYLAEG KEKEISKLLS TGVAYMFSFC LFFCLVLYLI MPFLLERFHI APAQQDLAST IFLGTAAVFS LELTLGGFRF VINGMHEFLK EKIVSTVAGL IEIGAILLFL YFGAGVKGLL YAFALRLVLE TIGCWAIARS LLPSLSVSWR LISRENFRLF LGFGGKVQVL GILGIFLTAL DRLFITAIAG LAAGGMFEIG RKLPSTAGGI SSSAFGPFLS TASHIEGRWA GEKPDAFPDR LKTYGLIVAT TVTLSLVPLF FLLPVQKRLQ GASPLIAVFA GVLTVVLFYL LNRRMKNENF LDNIELKQLY LNGIRFTNMI NSTLFLFLVA MAHPLMNAWV GKEYARAADV MIFLSTAYSI QLCTGPITMI FRGIDRNGRE LEYMLVQVIL MVIWIPAGTI ASGLIGSAAA IACSSIVSTC FLFWRSNNTF QIRFRKFVSV TVIPALVPLL PAVAVFAVSE IYPAEGRLVA VLQVLVCGVV YVLLSVMMFW KFILNGEEKS KALEMIPFNR KRNPPC
|
| |