Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2318 |
Symbol | |
ID | 4570956 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2661040 |
End bp | 2662260 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766879 |
Product | glycosyl transferase, group 1 |
Protein accession | YP_912733 |
Protein GI | 119358089 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0438] Glycosyltransferase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTTT TATTTGTACA TCAGAACTTT CCCGGACAGT TCCGACATGT TGCCAAAGCC CTGGCCGAGA TGCCGGAACA CCGGGTAGTC GGTATTGGAG AGAGCGCCAA TCTCAAGGGA CGTCTGTCGT TGCATCCCCG GATAAACGTA ATGGGGTATC AGCCGAAAAG GGGTGCGAGT CCAGAGACTC ACCACTATAT CCGTGACTTC GAAGGGGCTG TACGCCGGGG CCAGGAGGTT GCCCGTGTGG CGTTTGAGCT CCGAAAGAAA GGGTTTCGTC CCGATCTGGT GATCAGCCAT CCAGCCTGGG GGGAGTCGTT TTTTCTTCCG GATATTTTTC CGGATGCCCG TCATATCGGT TATTTCGAGT ACTTCTATCG GAGTTCCGGG GGGGATATCG GGTTTGATCC GGAGTTTCCT TCTTCTTTTG ATGATCGGCT GAAGGTAAGG ATTAAAAACA CGACTCAGCT TCTGAGCCTT GATTCTGCCG ATGCAGGAAT TTCGCCTACC CTGTGGCAGC AGAGCCGCTA TCCGAAAGAG TTTCATTCGA AAATCAGGGT AATTCACGAA GGGGTGGATA CCAACGTTGT CGCTCCTGAC GAAAATGCAT CGATTGATAT TGACGGAGCG CATTTCATAA GAGGCGACAG GGTTATTACC TATGTAGCCC GTAATCTGGA ACCTTGTCGA GGGGTTCATG TGTTCATTCG CGCTATTCCA CTGATTCAGG AGCTGTGCCC TGATGCACGG ATTGTTATTA TCGGAGGCGA TGATGTCAGT TACGGGAGAA GACCTACGGC AGGAACAACC TACCGGTCAC TTTATTGTGA CGAAGTGAAA GATGTAGCGG ACTGGTCACG GGTTCATTTT ACCGGCAGGC TGCCCTACAA CCGCTACCTG AAAATTTTAC AGCTCTCTTC AGCTCATGTT TATCTTACCT ATCCTTTTGT GCTCTCCTGG TCGATGCTTG AGGCAATGGC TGCCGGTTGT GTTGTGATCG GTTCAGCGAC GCCCCCTGTT CAGGAGGTCA TTACTCATGC AGAGAATGGC CTGCTGGTTG ACTTTTTCGA CAGGGAAGAA CTTGCTCGTA CGGTTGCCGG AGTAGTCAAT AACCAGTCAC AGCATGAACA GATAAGGCAA TCTGCCCGAC AGACCATACT TGATCGCTAT GATCTGCATA CAAAATGCCT GCCCGAACTG CTGCGGTATC TGATCGGGTA G
|
Protein sequence | MNFLFVHQNF PGQFRHVAKA LAEMPEHRVV GIGESANLKG RLSLHPRINV MGYQPKRGAS PETHHYIRDF EGAVRRGQEV ARVAFELRKK GFRPDLVISH PAWGESFFLP DIFPDARHIG YFEYFYRSSG GDIGFDPEFP SSFDDRLKVR IKNTTQLLSL DSADAGISPT LWQQSRYPKE FHSKIRVIHE GVDTNVVAPD ENASIDIDGA HFIRGDRVIT YVARNLEPCR GVHVFIRAIP LIQELCPDAR IVIIGGDDVS YGRRPTAGTT YRSLYCDEVK DVADWSRVHF TGRLPYNRYL KILQLSSAHV YLTYPFVLSW SMLEAMAAGC VVIGSATPPV QEVITHAENG LLVDFFDREE LARTVAGVVN NQSQHEQIRQ SARQTILDRY DLHTKCLPEL LRYLIG
|
| |