Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0889 |
Symbol | |
ID | 4570503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1012543 |
End bp | 1014303 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 639765484 |
Product | polysaccharide biosynthesis protein |
Protein accession | YP_911361 |
Protein GI | 119356717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC AGAGCAATGC ACATAAGAAA CTTGCCGGAA ATGCGTTATC CGGTATGGTT GCTATTGTCA TTTATATGGT AAGCCGTATT CTTTTGACTC CCTATATCCT GCACTATCTC TCGCTGACTG AATTCGGATT GTGGTCTCTC TCTTTTATTA TTCTTTCTTA TGCAGGGATG GGCGGATTCG GGGTTAACAG TACCTATATC CGTTATTCTG CACGGTATCT CGCTGATGGC AAGCAGAGTG AAATAAGTAA TCTTCTTTCG ACCGGCATAG CTTATATGTT ATCCTTTAGT TTGCTTTTTT GCTCGGTACT TTATTTTTTG ATGCCTTTTA TTCTCCGGCA ATTTCATATA GAACCTTCAC AACAGGAACT TGCTTCCACC ATATTTCTTG GAACAGCACT TGTATTCAGT CTTGAACTGA CGTTAGGCGG GCTTGCATTT ATCATCAATG GAATGCATGA GTTTGCAAAG GAAAAAATAA TCTCAACAAT TGCCGGGCTT TTTGAAATTG TATTTATTCT TCTCTTTCTT GCTCTTGGAG CAGGGGTCAA GGGATTACTT TATGCTTTTG CCTTAAGGAT TGTCATGTCA ACGATCCTCT GCTGGAAAGT TGCCCGCAGC CTGTTGCCAT CGTTAACGAT ATCATGGAAA CTGGTAACTC GTGAACACTT TCGCCACTTT ACAGGGTTTG GCGGGAAAGT CCAGGTGCTT GGTATTATTG GCATCTTTCT TACAGCGATG GACAGAATGT TTATTACCGC TATTTTGGGA CTTGCGTCCG GAGGTATGTT TGAACTTGGC CGAAAGCTGC CCTCTACTGC CGGAGGTATT GCCAATTCGG CATTTGCCCC GTTTTTATCT ACAGCAGCAC ATCTTGAAGG CTCATGGGCG GGTGAAATGA ACAATACGGT GGGAGACAGA ATTAAAACCT ATCTCATCAT TTCGATAATG GCCATTCTTT TTGCCATTAT TCCCGTTGTT TTTTTGCCGG GTTTTCAGAA ATATCTTCCG ATATCACCGG TTTTCATAGC TTCAGCAGTT GCCATGGTGT TTTTCTATCT GTTTTTTCAG CTTCAACACG AACAGAAAAA AAATAATTTC CTTGACAATC AGGAATTAAA AGAACTGTAT CTCAATGGCA TCCGGTTTAC AAATATCATA AGTTCAATAC TTTTTGTTTT TCTGGTTGTT ATGGCTTACC CCCTTATCGA TGCATGGGTT GGTTCAAAAT ATTCAGAGGC TGCAACGATC ATGATTTTTC TTTCTGCAGG ATACGCAGTC CAGCAGTGTA CCGGACCAAT AAACATGATA TTCAGGGGAA TAAACAAGAC AGGAAAAGAA CTGGAATATA TGCTTGTTCA GGTTTTATTG ATGCTGATCT GGATTCCGGC AGCAACAATA ACCTACAGTT TATCAGGTGC TGCTGCCGCA ATAGCATTAA GTTCAATAAC CAGCACCCTG TTTCTTTTTT TGCGAAGCAG TTATATCTTT CAAGTCAGAA TCTGGGAAAT TATTGTTCGA TCTATCCTTC CTTCGCTGGT GTCGTTTTTT CCCGCTTGCC TGATCTACAT CATTACGGTA CTGTTTCCTG TTACAGGGAG GATTGCTGTC ATTGCGCAAA TCCTTGTCTG TGGAGTTCTC TATCTCATCA TGACCATAGC GCTGCTTTGG GGCATTGTTT TGAACGAAGA TGAAAAAAAA CAGGCAATTG CATTATTGCC ATTTAAAAAG AAGATGGACT CATCACAGTG A
|
Protein sequence | MKNQSNAHKK LAGNALSGMV AIVIYMVSRI LLTPYILHYL SLTEFGLWSL SFIILSYAGM GGFGVNSTYI RYSARYLADG KQSEISNLLS TGIAYMLSFS LLFCSVLYFL MPFILRQFHI EPSQQELAST IFLGTALVFS LELTLGGLAF IINGMHEFAK EKIISTIAGL FEIVFILLFL ALGAGVKGLL YAFALRIVMS TILCWKVARS LLPSLTISWK LVTREHFRHF TGFGGKVQVL GIIGIFLTAM DRMFITAILG LASGGMFELG RKLPSTAGGI ANSAFAPFLS TAAHLEGSWA GEMNNTVGDR IKTYLIISIM AILFAIIPVV FLPGFQKYLP ISPVFIASAV AMVFFYLFFQ LQHEQKKNNF LDNQELKELY LNGIRFTNII SSILFVFLVV MAYPLIDAWV GSKYSEAATI MIFLSAGYAV QQCTGPINMI FRGINKTGKE LEYMLVQVLL MLIWIPAATI TYSLSGAAAA IALSSITSTL FLFLRSSYIF QVRIWEIIVR SILPSLVSFF PACLIYIITV LFPVTGRIAV IAQILVCGVL YLIMTIALLW GIVLNEDEKK QAIALLPFKK KMDSSQ
|
| |