Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0371 |
Symbol | |
ID | 6374033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 391883 |
End bp | 393697 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642682890 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_001958819 |
Protein GI | 189499349 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGCAGA AAACAATGCT TTCGCTGCCC TTAAGGGTAA AGCAGACTCT TGCGGTTATG CAGGATAGTC TGGCGGTAGT GTTTTCCGTA TGGCTCGCCT ATTCGCTTCG TTTTGAGGTA TGGCATGTTC CTCAAAAAGC ACAATGGCTG GTATATGGTA TCGCTCTGGC TATTTCTCTG CCTGTTTTTT ATGGCACCAG CCTCTACAAG TCAGTCTTTC GCTTTAGTGA GGCTATGGCA TTTAAGCAGA TAACAAAAGC TGCGGCACTC TATGCGGTGC TGTTTTTTTT TACTCTTGTC TTGTTCAAAA TAGATGGGGT CCCGCGTTCA ATCGGAATAA CGCAGCCAAT AATGCTTTTT CTGCTGCTTC TCGCAAGCAG AGGCGCAGCC CGTTTTTTTC TGAACACCCG CTTGCAGTTG GGTCATTGCA GTACTGCCGA CAAGCGGTTG CTGATCTACG GTGTCGGATC GGCAGGAATT CAGCTCGCGT CAGCCATCGA ACAAACTACC CGCCACCTTC TTATCGGTTT TATCGATGAT GATCCCAAAC TGCATGGACG GATGGTAAAA GGCCTGAAGG TATTTTCTTT TGGCCAGGTG GCCCGGCTTG TAGAACAGGC GACGGTTACC GACATTCTGC TGGCGATACC CTCTGCAAGC CGTTCAAGGC GCAATCAGAT TTTGCAGGCG TTACAGCCGT TTCCCGTGCA CGTGCAGACA CTTCCAAGCC TTGAGGATTT GACAGATGGC AATATTTCAG TCAGAGATGT AAAAGAGATT GAAATTGAAG ACCTGCTTGG CCGTGATCCT GTTTCTTCCG GGCCATCTCT TTTCAGGCGT AACATTACCG GCAAGACCGT AGTTGTAACC GGTGCGGGGG GGAGTATCGG GAGAGAGTTG TGCAGGCAGA TACTTATCGG GTGCGCCGAT AAGCTCATTA TGATTGATCA TGCTGAATTC AATCTGCATG ACGCATATCT CGAGCTGGAA GGGTACAGAG AAAGAAAACA GCAGGACACG GAGATTGTGC CTCTTTTGTG CAATGTGGCC GAGAACCATC GATTCAGCGC AATCTGCTCT TCATATCACC CGCATACCAT TTACCATGCT GCGGCGTACA AACATGTTCC CATGGTTGAG CGCAATCCTG TCGAAGGCGT TCGGAATAAT GTTTTCGGGA CTCTTCGTTC AGCTCTTGCC GCGCAGGAAT ACGGCGTTGA AAGTTTTATC CTGGTCAGTA CCGATAAGGC TGTTCGTCCA GCAAATTTCA TGGGGGCGAG CAAGCGATTG TGTGAGATTA TTCTTCAGGC GCTATCTGCT GAAGAGAACG GTAGCGGCAC GTGTTTCTCT ATGGTCCGTT TTGGTAATGT GCTGGATTCA AGTGGTTCTG TAGTGCCGCT TTTTCGCCAG CAGATAAAAG ACGGCGGGCC TGTAACCGTA ACACATCCGG AAATAACCCG ATATTTTATG ACTATTCCGG AAGCTGTGCA GCTTGTTATT CAGTCAGGCG CGATGTCTCT GGGCGGAGAG GTTTTTGTCC TTGATATGGG TGAGCCTGTC AAGATAATTG ATCTTGCGAA ACGCATGATC GGATTGTCAG GTTTAACGGT ATTGGACAGG AATAACAGAA AAGGTGATAT TCCTATAGAA GTAATCGGCC TGCGTCCGGG GGAAAAGCTC TATGAAGAGC TGTTGATAAG CGGCGATCCT CAGCCGACTG ATCACCCGAG GATATTCAAG GCGCATGAAG AGTTTATTGC ATGGCCGGCA CTTCAGGATG AACTTTCGGA GATGGAGCAG GTTCTTAATA CCTGA
|
Protein sequence | MLQKTMLSLP LRVKQTLAVM QDSLAVVFSV WLAYSLRFEV WHVPQKAQWL VYGIALAISL PVFYGTSLYK SVFRFSEAMA FKQITKAAAL YAVLFFFTLV LFKIDGVPRS IGITQPIMLF LLLLASRGAA RFFLNTRLQL GHCSTADKRL LIYGVGSAGI QLASAIEQTT RHLLIGFIDD DPKLHGRMVK GLKVFSFGQV ARLVEQATVT DILLAIPSAS RSRRNQILQA LQPFPVHVQT LPSLEDLTDG NISVRDVKEI EIEDLLGRDP VSSGPSLFRR NITGKTVVVT GAGGSIGREL CRQILIGCAD KLIMIDHAEF NLHDAYLELE GYRERKQQDT EIVPLLCNVA ENHRFSAICS SYHPHTIYHA AAYKHVPMVE RNPVEGVRNN VFGTLRSALA AQEYGVESFI LVSTDKAVRP ANFMGASKRL CEIILQALSA EENGSGTCFS MVRFGNVLDS SGSVVPLFRQ QIKDGGPVTV THPEITRYFM TIPEAVQLVI QSGAMSLGGE VFVLDMGEPV KIIDLAKRMI GLSGLTVLDR NNRKGDIPIE VIGLRPGEKL YEELLISGDP QPTDHPRIFK AHEEFIAWPA LQDELSEMEQ VLNT
|
| |