Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_3256 |
Symbol | |
ID | 7267403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | - |
Start bp | 3945235 |
End bp | 3948312 |
Gene Length | 3078 bp |
Protein Length | 1025 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643568077 |
Product | O-antigen polymerase |
Protein accession | YP_002464550 |
Protein GI | 219850117 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3307] Lipid A core - O-antigen ligase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.711295 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTACTG TTGCCCTCCG CCTTATCCTG CTTGTTGCGC TCAGCGCACT GCTCATCGCT GCGCTCGGTG TTGCCGTCTG GGTCGACAAT GCCTCCTTGC GCGGTGTCGC CGAACGTTCG CCTATCACGG CCCCCTTCCC CTACAGCGAT GGCCCGGCGC TGGGGGTGAA CGTCTTTAAC CTCCACCTGG AGCCTGACCC CGTCGCAGTT GACCGGACCT TCGCCCTTGC CCGCGATCTC GGCGCACGCT ATGCTCGTAT GCAGGTACCC TGGGATGACA TCGAGATTCA TAGCCGCGGT GATTTTACCG ACCGCCGTAA CGCTGCGACG ATCGGTGTGG TGTCGAGTTG GGATAAGTAC GACCGCATTG TCGCTGCTGC CGTTGCCCAC AATATCGAGT TGATTATGCG TGTCGACCGG CCTCCCCCGT GGGCCTGCGC CGCTGCATGC TCGACCCCGG AGTTTCAGGC AGGTTTGGCA ATCGATGGCA ACTCGATGGC CCCACCCGAT GATCTGACCG ATTATGCTCG CTTTCTCGCC ATCCTCGTTG AACGTTACCG CGGCCAAGTA CGCTATTTTC AGATTTGGAA TGAGCCGAAT CTCAAGAATG AATGGGGTTG GCAAACACCC AAACCTGCCG ATTTTCTCGC GCTGCTCCGC CTTGCTTACG AAGCGGTGAA GACGGCAAAC CCCGATGCAG TCGTGCTCTT CCCCGGTCTA GCCCCCACCG ACGGTCTCGA TCCGCGGGCG CCAATGACCG AGTTGGAGTA TCTCGATGAA ATCTATCGGC TTGGCGGCGC TGCCTATTTC GATATTATGG CCGCCCAGAA TTATGGCCTC GGCCAGCCCC CCTCTGAACA TCGCTATGTC TTTTTGCGCG GACGCGACAA TTGGCGCTGG GATCGCCCTA TCGATACCCG CAACGATGTG AGCCGGGTGG TGCTGCTGCG CGAGGTGATG GAACGCCACG GCGATCTGGC GACACCGGTC TGGGTGACGG AGTTTGGTTA TAACGCTGCG CCGGATCGGA TTCCGTCTGA GCATCGCTTT GTCTGGGGAC CGCCGGTTGA TGAGCTGACA AAAGGTGCGT ATCTGGTCGC TCAGATTGAG CGCGCCCGCC GCGAATGGCC GTGGATGGGG GTGATGAATG TCTGGATGTT GCGCTACGGC GGCTATGCCG AGCCGCATCC CGATGATCCT ACCCCCTACT TTGCACTGGT GAGCCGCGAT TGGCAGCCCT TGCCTTCGTA TGACATCCTG CAAGATTATG TCAAATCACC GGCAGTCGCT CATGTTGGTG TGCATCGCTG GGATCATCCG GCAGTGACGA CAACTCCCAC CGGCTGGCAG GTGCGATTTG CCGGCACCGG CATCGAGTTG CGTGGCGGCA CGCCTACTGC GGCGCTGCTC GATGGTGTGC CGGTCGCACT AGACGGCAAT GCATTGCGCG GTTTGCCCGA TGCGGTGCAT ACCCTCGAAC TACGCGGCGG CACGCCGCCA ACCGAGTTCT CCGTGGTGCG GAGCTTGCCA TGGCGTCCGC TTGCCGATTA CGGGCCGTTG GTGCTGATTG TGGCTTTAGC GGTGACCGCT GCGCTGACAA TGCGCACGGC CCTGCACGTG CTCGATCATC TGCTGATGCG TTGGCAAACG CTCCCTGCCC ATTGGCGCGA ATGGATCATC TTTCTCGCCC AAGGTCTCAC CCTCGCAGTG GCCTATCGTG CTTCGGCACA ACTCCCGCTC ACGATGCTTG GTCTCCTACC GTTTATCGGT TTAGCTATTG CCTATCCTGC ACTTGCCGTG CGTTGGGTAG CTATCACGGT GCCACTCTAC TTTTTGCCGA AAGGTCTGTT TGACGCTCGG TTTGGCATTC GCGAGAGCGG TATCTATCTG CCACTCCACG AAGTAGTCTT GCTCATTGCC GCTCTTGCGA CGGTGGTGCG CGATCGTTCA CATCTCCTGC ACCTTAATCC TACCATCCTC CGCTCACGAG CCACCCTCAT CGCACTCACA CCTGCATTGC TCGTCTTGAT CGCCGGCGTG TGGGGTGTGC TGATCGCCGA AGCGCGCGGC CCAGCCTTGC GCGAACTACG TTGGATGATC GTCGAACCGT TGCTGTTCGC GGCCTTGCTC TGGTGGCACG AGCGACAAGG CCGCCCCACG CTGATGCCAA CGCTGATCGG TTGGATCGCC GCCGGCGCCG TCTCGGCCCT GGTAGCGATC GCTCAAGCCG GCGGGATCAA TCTTGTACCG TTGTTCGGTA GCAAAATCGG GTATAGCGAA GACCTGATCG CCACCGAAGG TGTGATACGC GCCACCGGTT TCTACGGTCA TCCCAACAAC CTCGGTCTGG CGATGGGGCG GGTTTGGCCG TTGGCGGCAG CGTTGGCGTG GGCTGTATGG CAGCGGCAAC AACGTTGGTT GGCAATGGTG TTGGCATCCT GTGCAATCCT TAGCCTCGCT GCTCTTGGCG TTTCATTTTC CCGTGGTGCA TATCTTGGCG CAATCGTTGC CGGCGGGGTA CTCCTCTTTT TCGCCACACC CCCGCGCTAT CGTTGCCTGT CGCTGATCGC CGGTGGCATT GTCATCGTTC TCGCAGCCGG AGCTAGCCTT ATCATCGGTA TCGAACGGCT TAGTCTCATG ACCGGCAGCA GCACCATTCG CCTCGCCACA TGGCGCGCCG CATTGGCAAT GTTGGTCGAT CATCCGCTGG GGATAGGTCT CGATCAGTTT CTCGTTGTCT ATCCTCGTTA CACCGATCCG GCCCTGACGA ATACCAACGA GATCTACACT GCCCATCCGC ATAACCTGAT ACTCGATCTC CTCTTGCGCG GCGGCCCTCT TCTGCTCATC GGGTTAGGCT GGGCTACGTG GTGTATGATC CGTACCGCAG CTCGCTACCC CACTTTACCA CTCGCCGTTG GGATTACCGC AACAATGGCC GGGGCACTCG CGCATGGATT AGTCGATGCG TTCTATTTCT GGCCCGATTT GGCGATGAGC TTCTGGCTCC TCGTAATGAG TAGCCGAATC GGGCTAAGTT CATCAGCGCT TGCTCAATCA TCGCCGGCCC AAACATAA
|
Protein sequence | MRTVALRLIL LVALSALLIA ALGVAVWVDN ASLRGVAERS PITAPFPYSD GPALGVNVFN LHLEPDPVAV DRTFALARDL GARYARMQVP WDDIEIHSRG DFTDRRNAAT IGVVSSWDKY DRIVAAAVAH NIELIMRVDR PPPWACAAAC STPEFQAGLA IDGNSMAPPD DLTDYARFLA ILVERYRGQV RYFQIWNEPN LKNEWGWQTP KPADFLALLR LAYEAVKTAN PDAVVLFPGL APTDGLDPRA PMTELEYLDE IYRLGGAAYF DIMAAQNYGL GQPPSEHRYV FLRGRDNWRW DRPIDTRNDV SRVVLLREVM ERHGDLATPV WVTEFGYNAA PDRIPSEHRF VWGPPVDELT KGAYLVAQIE RARREWPWMG VMNVWMLRYG GYAEPHPDDP TPYFALVSRD WQPLPSYDIL QDYVKSPAVA HVGVHRWDHP AVTTTPTGWQ VRFAGTGIEL RGGTPTAALL DGVPVALDGN ALRGLPDAVH TLELRGGTPP TEFSVVRSLP WRPLADYGPL VLIVALAVTA ALTMRTALHV LDHLLMRWQT LPAHWREWII FLAQGLTLAV AYRASAQLPL TMLGLLPFIG LAIAYPALAV RWVAITVPLY FLPKGLFDAR FGIRESGIYL PLHEVVLLIA ALATVVRDRS HLLHLNPTIL RSRATLIALT PALLVLIAGV WGVLIAEARG PALRELRWMI VEPLLFAALL WWHERQGRPT LMPTLIGWIA AGAVSALVAI AQAGGINLVP LFGSKIGYSE DLIATEGVIR ATGFYGHPNN LGLAMGRVWP LAAALAWAVW QRQQRWLAMV LASCAILSLA ALGVSFSRGA YLGAIVAGGV LLFFATPPRY RCLSLIAGGI VIVLAAGASL IIGIERLSLM TGSSTIRLAT WRAALAMLVD HPLGIGLDQF LVVYPRYTDP ALTNTNEIYT AHPHNLILDL LLRGGPLLLI GLGWATWCMI RTAARYPTLP LAVGITATMA GALAHGLVDA FYFWPDLAMS FWLLVMSSRI GLSSSALAQS SPAQT
|
| |