Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_1182 |
Symbol | |
ID | 3748216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 1575023 |
End bp | 1576036 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637773716 |
Product | capsular polysaccharide biosynthesis protein I |
Protein accession | YP_379487 |
Protein GI | 78189149 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.139241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTGT TAGTTACAGG CGCTGCTGGC TTTATTGGCT CTACTCTTTG CAAGCGCTTG CTGGAACGCG GCGATCGCGT AACGGGTATT GATAACCTTA ACGATTATTA CGATGTCTCC TTAAAAGAGG CTCGTTTAGC GCAGCTTCAG CCATATGAAA ATTTTACCTT TGTAAAGGGA GATTTAGCCG ATCGCGCTGG TATGGAGGCA CTTTTTGCAA AGGGTGAATT TGAGGGCGTA GTCAACTTGG CAGCACAAGC GGGGGTGCGC TATTCCATTG AAAATCCTCA TTCTTATGTA GAAAGCAATA TTGTTGGATT TTTGCACATT CTTGAAGGAT GTCGCCATCA CGGCGTGAAG CACTTGGTTT ATGCCTCATC CAGCTCAGTG TATGGCGCAA ACGAAACCAT GCCATTTTCG GTACACGACA ACGTTGACCA TCCACTTTCG CTTTACGCTG CCAGCAAAAA AGCCAATGAA TTAATGGCGC ATACCTATAG CCACCTTTAC AACATTCCAA CTACGGGCTT GCGCTTTTTC ACCGTCTATG GACCGTGGGG GCGCCCCGAT ATGGCGCTCT TTTTGTTTAC CGATGCTATT CTAAAAAACA AGCCCATCAA GGTTTTTAAC TACGGCAAGC ACCGTCGCGA CTTCACTTAC ATTGATGATA TTGTGGAAGG CGTTATTCGC ACGCTTGACC ACACAGCAAC GCCAAATCCA GCATGGAGTG GGGCAACACC CGATCCGGGC AGCAGCAAAG CGCCATGGAG AGTTTATAAT ATTGGCAATA GCCAACCAGT GGAGTTGATG GATTACATTC AAGCGCTGGA AAACGAGCTT GGTCGGACGG CAATTAAGGA GTTTTTGCCG CTTCAGCCTG GTGATGTACC CGATACCTAT GCCGATGTTG ATCAGCTTAT TGAAGATGTG CACTACAAAC CGCAAACGAG CGTGCCAGAA GGGGTAAAAC GTTTTGTTGC TTGGTATAAA GAATATTATG GAGTAAAAGG GTAA
|
Protein sequence | MNVLVTGAAG FIGSTLCKRL LERGDRVTGI DNLNDYYDVS LKEARLAQLQ PYENFTFVKG DLADRAGMEA LFAKGEFEGV VNLAAQAGVR YSIENPHSYV ESNIVGFLHI LEGCRHHGVK HLVYASSSSV YGANETMPFS VHDNVDHPLS LYAASKKANE LMAHTYSHLY NIPTTGLRFF TVYGPWGRPD MALFLFTDAI LKNKPIKVFN YGKHRRDFTY IDDIVEGVIR TLDHTATPNP AWSGATPDPG SSKAPWRVYN IGNSQPVELM DYIQALENEL GRTAIKEFLP LQPGDVPDTY ADVDQLIEDV HYKPQTSVPE GVKRFVAWYK EYYGVKG
|
| |