Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1529 |
Symbol | |
ID | 5898984 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1619891 |
End bp | 1621390 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562016 |
Product | lipopolysaccharide biosynthesis protein |
Protein accession | YP_001683157 |
Protein GI | 167645494 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.417351 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.787059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACGA CGAGCGCCTG GACCAGCGTA CCGCATGACC CGCCGTCGCG TAGCGACTGG TCGGCGCGCG CCCGCTATGC GCCGACCGAC TTCGTCACCC TGCTGTGGCG CGAGCGCTTC TTGATGCTGG GGGTGTTCCT GGCCCTGTTC CTGCTGGGGC TGGCCTTCGC CAGCACGATG AAAAAGACCT ACACCGCCCA GTCCAGTCTG TTCGTCCGCC TGGGCCAGGA ATATGTCTAC GAGCCCCGGG CCGGCGACGC CGCCCGCGGC GCGGTGCCCG ACGTCGACCA GGTGATCCAG TCGGAATCCG AGATCCTGGG CAGCGGCGAA CTGCGCGATC GAGTGATCCG CAAGGTCGGC TTCGCCAGGA TCTTCCCCGA CGCCGCCCAC AAGTACGCCG CCGCCTCGCC GGAGGGGAAG CGCAAGCTGA TCGCCGAGGG GCGCGACGCG GTGGGCCGCA ACCTGAAGAT CGAGACCGCC CCGGACAACT CGATCATCCG GCTGTCCTAT TCCAACGGCG ACGCCGACGT CGCCGAGAAG GTGCTCAACA CCCTGCTCGA AGAGTACCTG ATCTATCGCC GCAGCCTGCT GATCGGGGCC GGCGACAACG GGCTGGAGCG CCAGCGCGAA CTGTTCACGC GCAAGCTGGC CGAGACCGAC ACCGCCTATC AGGCCTTCCT GTCGGGCAAC GACATCGGCG ACTTCACCGC CCAGAAGACC GCCCTGACCC AGCTGCAGGC CCAGGCCGAG TCCCAGAAAT ACGCCACCGA GGCCCAACTG CAGGACCGCA TGGGCCGCCT GGCCTCGGTC CAGGCCGAGC TGGCCCGCAC CCCGGCCGAC ACCGTGCTCT ATCGCGACAG CGACATGTCG GCGTCCGGCA AGCTGGCCCA ACTGAAGCTG GATCGCGAGG GCCTGCTGTC GCGCTACCGC CCCGACGCCC AACCGGTCCG CGACATCGAG GCCCAGATCG TCCAGCTGGA GCAGGGCGTC GCCTCGGGCC GCACCGCCGG CGACGGCGCG CGCCGCAGCG GCCCCAATCC GATCTGGCAG ACCCTGCAGT CCACCCGCAA CGACCTTTCG GCCGAGGTCG CGGCCCTGCG ACAGTCGCTG GTCGCCTACA CCCAGCAGGT TCAGGACGTG AACCAGCGCC TGATGCGCCT GTCGGCGCTG GAGCCGACCT TCAACCAGCT CAGCCGCGAC CGCGATGTGC TGTCGTCCAA CGTCCGCGAC TTCACGGTCA AGGAACAGCA GGACGAGGCC CAGCGGCAGA TGTCGGCCGA GGGCAGCGAC AATATCCGCA TCGTCCAGCG CGCCGTGGCG CCGTCGACCG GCAAGAGCCT GAAAAAGCCG ATCATCGTCC TGGCCTTCCT GTTCGCGGCC TTCACCGCCG CCTGCGCCGG CCTGGTGCGG ATGCTGCTGC GGCCCGGCCT GCAGACCCCG GCTTCGGCCT CGCGCACCCT GGGCCTGCCG GTCCTGGCCA CGGCCAGCTA CAAGCGCTGA
|
Protein sequence | MSTTSAWTSV PHDPPSRSDW SARARYAPTD FVTLLWRERF LMLGVFLALF LLGLAFASTM KKTYTAQSSL FVRLGQEYVY EPRAGDAARG AVPDVDQVIQ SESEILGSGE LRDRVIRKVG FARIFPDAAH KYAAASPEGK RKLIAEGRDA VGRNLKIETA PDNSIIRLSY SNGDADVAEK VLNTLLEEYL IYRRSLLIGA GDNGLERQRE LFTRKLAETD TAYQAFLSGN DIGDFTAQKT ALTQLQAQAE SQKYATEAQL QDRMGRLASV QAELARTPAD TVLYRDSDMS ASGKLAQLKL DREGLLSRYR PDAQPVRDIE AQIVQLEQGV ASGRTAGDGA RRSGPNPIWQ TLQSTRNDLS AEVAALRQSL VAYTQQVQDV NQRLMRLSAL EPTFNQLSRD RDVLSSNVRD FTVKEQQDEA QRQMSAEGSD NIRIVQRAVA PSTGKSLKKP IIVLAFLFAA FTAACAGLVR MLLRPGLQTP ASASRTLGLP VLATASYKR
|
| |