Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Clim_1834 |
Symbol | |
ID | 6355175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium limicola DSM 245 |
Kingdom | Bacteria |
Replicon accession | NC_010803 |
Strand | + |
Start bp | 2012750 |
End bp | 2014516 |
Gene Length | 1767 bp |
Protein Length | 588 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642669438 |
Product | glycosyl transferase, WecB/TagA/CpsF family |
Protein accession | YP_001943852 |
Protein GI | 189347323 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATCACA CGGCAGGCAT TGTGCTCGAA AACCGTTCGG GATCTGCAGA AAGAGCGAAA GCCATCATCT TCGGCAGAAA AGAACAAGTC GGCGACATCT GCAAACTGAT CGAAAACAGT TACAGCCAGG TCACCCTTGC CGAAACATCC CGTGAACTCG AAAAATCAGG GCCCGACAAA GAACAATTCA GGCTTGCAGT AATGACCGAC AGTTTTTCCG AAAAACTCAG CATAGGGCTC CTCAACCGCG TCAGACAGAA ATTCGACCCT GAAAACATGA TCTGCCTCTC CGGAGAAATA GAAGAAGACA ACGAAATAAC GCTGCGCTCC GCAGGGCTGA TCTTCCTGGG CAGCTACGGA ACATTTTTCG CCCATGCCGA CAACATCATC AAACACACCC TGAACAACGG CGGCAACAAC GCAATGAAAC CGGAAATCGA GACCGCAAGA AATCTGGAAA AACGCCTCAA AAGAGCGCGA GGCTCAGGAA GAAGCCGAAG AAAACGGTTA TCATTCTTAA TCACATCCTC CATAGCGGAA ACAGCCGCGA GGAGCATCGA ACTCCTGACA GCACTTGCCG TCACCATCAC GCTCTTCATT CCCGTACTGC TTATCCGGCT CCTTATACGC ATCCCATGCG GTCAACCGGT CTTTTCCAGG CGAACGGTCT GCGGCATGGC CGGCCAACCC ATCACTATCC GTACCTTCAG CGACCTCAAG GGACGAATGG CCGATCTTCC GCTCTTCCTC GAACTCTTTA CCGGACGCCT TGCCCTTGCA GGTACCGCAA TCAGAGAGTG GGACGCTCCC GACCCCAATG CCGAACAAGC CTACATCAGC ATGGTCAAAC CCGGCATCAT ATCACTCTGG GACATCCGCC GTACCAGCAA AATCGCGCAC GAAGGACGCG AAGCCATCGA ATGGGAATAT ATCTTCAGCA AACGCCCGGC CTATGACCTG CTGCTTCTGC TCAGAGCACT GCCTGCAATG CTCTACAGCG AAACGACCTC CACATACGAT CCGGTATTCA GGCTGCTCGG ACTTGACATC GACAACATCA CCATGGCTGA AGCGGTCTCG CTCATACAGA CCGACCTCCG CGACAACCGG CAGCAAGCCA TCTATTTCGT CAATCCAGAC TGCCTGAACA AAATGGCCGG AGACAGGGAG TACTGCGAAG TCCTGAAAGA CGGCGACAGC ATATTCCCCG ACGGCATCGG CCTCACCATT GCCGGAAAAC TCCTGCAGAG CCCCCTCAAA GAAAACATCA ACGGCACAGA CATGCTCCCC TATCTCTGCA GGATGGCGGC AGCCGAACGA CACAGCATAT ACCTGCTCGG CGGCAAACCC GGCATAGCCG ACAAAGCCGC AAGCAAAATC AACCGCGAAT TCGGCGTCAC CATCGCAGGC ACCGCCGACG GCTACTTCAA CCACGAAACC GAAACAGGCC GCATCATCGA CGATATAAAC CGCTCCGGAG CCTCCATCCT GCTCGTAGCA TTCGGAGCCC CGCTGCAGGA AAAATGGATC CACCGCCACC GAAACCGGCT CCAACCCGCG CTCCTCATGG GTGTAGGCGG ACTCTTCGAC TTCTACTCGG GCAACGTTCG TCGCGCCCCT CGCTGGATGC GTGAAATCGG CATCGAATGG ATATACAGGA TCATGCAGGA ACCCGGACGG ATGTGGCGTC GCTACGTCAT AGGCAACCCG CTCTTCCTCT ATCGCGTCAT GAAATGGAAA CTCCTAACCG GCAGCGGCAA CCACTGA
|
Protein sequence | MYHTAGIVLE NRSGSAERAK AIIFGRKEQV GDICKLIENS YSQVTLAETS RELEKSGPDK EQFRLAVMTD SFSEKLSIGL LNRVRQKFDP ENMICLSGEI EEDNEITLRS AGLIFLGSYG TFFAHADNII KHTLNNGGNN AMKPEIETAR NLEKRLKRAR GSGRSRRKRL SFLITSSIAE TAARSIELLT ALAVTITLFI PVLLIRLLIR IPCGQPVFSR RTVCGMAGQP ITIRTFSDLK GRMADLPLFL ELFTGRLALA GTAIREWDAP DPNAEQAYIS MVKPGIISLW DIRRTSKIAH EGREAIEWEY IFSKRPAYDL LLLLRALPAM LYSETTSTYD PVFRLLGLDI DNITMAEAVS LIQTDLRDNR QQAIYFVNPD CLNKMAGDRE YCEVLKDGDS IFPDGIGLTI AGKLLQSPLK ENINGTDMLP YLCRMAAAER HSIYLLGGKP GIADKAASKI NREFGVTIAG TADGYFNHET ETGRIIDDIN RSGASILLVA FGAPLQEKWI HRHRNRLQPA LLMGVGGLFD FYSGNVRRAP RWMREIGIEW IYRIMQEPGR MWRRYVIGNP LFLYRVMKWK LLTGSGNH
|
| |