Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2118 |
Symbol | |
ID | 5899573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2281158 |
End bp | 2282489 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641562607 |
Product | capsule polysaccharide biosynthesis protein |
Protein accession | YP_001683744 |
Protein GI | 167646081 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3562] Capsule polysaccharide export protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.107224 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.128982 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC CGCGTCGCTG CTCCTGGCGC GGACCCCGAC CCGTGGCCCA CACCTCGCCG ATCCGGCCCG GCCGACGGGA AATCCTGTTC CTGCAGGGAC CGCCCGGCTG CTTTTTCAGG GAATTGGCCC GACAGGTCGA GGCCGAGGGG CACGGCGTCC ATCGCATTAA TTTCAACGGC GGCGACGCCC TGGACTGGAG GGGCGGCGGG ATCAATTACC GCGGCGGCGT CGGGGCGTGG CCGAGCTTTT TGGCGCGGTT GCTGCTGGAA CGCGACATCA GCGACGTGGT GCTGTTTGGC GACTGTCGCC CCATCCATCG CGCGGCGCGC GGCGTGGCGG CGGGCCTGGG CCTGACGGTT CATGTCTTCG AGGAAGGTTA TATACGCCCT AATTGGGTGA CCCTGGAGCG AGGCGGGGTC AACGGGTTCT CCACCCTGTC GTCCGATCCG CAATGGTATC TCGACGCCGC CGAGCGGCTG GCGCCCATAC CCGAACACGG TCCCTTGCCG TCGGCGATCG ATCGGCGCGC CCGGGCCAGC GTCGCCTATC ATCTGGCGAC CGTCCTGTCG GCGGCGGCCT TTTCGGGCTA TCGCAACCAT CGGCCCTGGC ACCCGGCGGC CGAGGCCGCG GGCTGGGCCG GGCGTTGGAT CCGGCGGCGC CTGGGCGGCG CAAATCCCGA GCCAAGCCTG GATGGGACGC CCTACTTCCT GCTGCCGCTG CAACTGGACT CCGACTACCA ACTCCGCACG CACTCGGACT ACGAGGGCAT GCAGCCGGCC CTGGCCCAGG TGATCGCCTC GTTCGCCCGC CATGCGCCGG TCAACGCCAG CCTGGTCGTC AAGGAACACC CGTTGGACAA CGGCCTGCGC GACTGGCGGC GCCGGACCTT GGATTATGCG CGCGCCCTCA ACGTGTCCGA CCGCGTGGTG TTTCTCGACA CGGGGGATAT CGACACCCTG GTTGGCGACG CCCAGGGGGT GGTGACCATC AACAGCACCA CCGGGACCCT GGCGCTCGCG GCGGGCGTGC CCGTGGCCAC CCTGGGTCGC GCGATCTACA ATATCGCCGG CCTGACTCAT CGCGGCCCGC TCGACACCTT CTGGCGGACG CTGACCAAGC CCGACCCTCG CCTCTACGAG GCCTTCCGCC GGGTGCTGGC CAGCCGCTGC CTGCTGTGGG GCGGGTTCTA CGACCTGGCG ACGCGTCAGG CCCTGGTCCG GGCGGCGACC GAGCGCATGC TGGGACCGCG ATCCGACACG GTCGCCGCGC CCAAGGGCGC GCCGCCGTTC GTCAGGAATC CGGCGCCGCC AGCCCTGATC GCCGCCGAAT GA
|
Protein sequence | MTAPRRCSWR GPRPVAHTSP IRPGRREILF LQGPPGCFFR ELARQVEAEG HGVHRINFNG GDALDWRGGG INYRGGVGAW PSFLARLLLE RDISDVVLFG DCRPIHRAAR GVAAGLGLTV HVFEEGYIRP NWVTLERGGV NGFSTLSSDP QWYLDAAERL APIPEHGPLP SAIDRRARAS VAYHLATVLS AAAFSGYRNH RPWHPAAEAA GWAGRWIRRR LGGANPEPSL DGTPYFLLPL QLDSDYQLRT HSDYEGMQPA LAQVIASFAR HAPVNASLVV KEHPLDNGLR DWRRRTLDYA RALNVSDRVV FLDTGDIDTL VGDAQGVVTI NSTTGTLALA AGVPVATLGR AIYNIAGLTH RGPLDTFWRT LTKPDPRLYE AFRRVLASRC LLWGGFYDLA TRQALVRAAT ERMLGPRSDT VAAPKGAPPF VRNPAPPALI AAE
|
| |