Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2528 |
Symbol | |
ID | 5899983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2741660 |
End bp | 2744110 |
Gene Length | 2451 bp |
Protein Length | 816 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641563019 |
Product | organic solvent tolerance protein |
Protein accession | YP_001684153 |
Protein GI | 167646490 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1452] Organic solvent tolerance protein OstA |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.150977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGAGT TTCGCTGGGG GGCGAAGGAC CCAAAGGCCG TCGGTATGGG CCGCGCGGTC CTGCTGGCCG GTGCGGCCTG GCTCGCCCTC GCCTGCTCGG CGCAGGCGCA ACAGCCGCTC GCCACCGTTC CGGCCGCTCC GACCCCGTCC CCGGCGGTCG ACGATGGCCT GGGCGACACG GGCTACTATC TCGAATCCGA CCTGCTGATT CGCGACGACG CCAATCAGAA GATGATCGCC CGCGGCGAGG TCGAAGCCCG CTACCAGGGC CGCACCCTGC GGGCCGACGA GGTGGTCTAC GACAGCAAGA CCGAGGTGGT CACCGCCCAC GGGCACGTGC AACTGATCAA CGCCGACGGC ACCGCCCAGT TCGCCGACGA AATGACCATG GACAAGGACA TGAAGGCGGG CTTCGCGCGC GGCTTCTCGG CCCGCCTGGA CAAGAACATC AAGATCGCGG CGGACACCGC CGTGCGTCGC AACGAGCAGA TCACCGAGCT GAACCAGGCG ATCTACACAC CCTGCGAGGT TTGCGCCGAA AAGCCCAAGC CGACCTGGAG CATCCAGGCC GACAAGGTCG TGCAGGACAA GAACCGCCAC CTTGTCTACT ACCACGGCGC GACGATCCGC ATGTGGGGCG CGCCGCTGCT GTACCTGCCG GTGTTCTGGC ATCCCGACCC GCAGACCCAG CGCAGTTCGG GCTTCCTGAC GCCGAAGCTG GGCGTCTCCA AACGGCGCGG CGTCTCCTAT CAGCAGCCCT ATCTGTTCGT GTTCTCGCCG TCTCAGGATT TGGTGCTAAC CCCTCAAATC AATGCGAAAG TTAACCCGTT CCTGAACGCG CAGTACCGCA AGCGCTTCTA CTCCGGCGCC GTCGATGTCC GGGCCGGCGG AACCTACGAC AAGGACTTCG ACAACCACGG CGACCGGTTC GGCAAAGGCA TGTTCAAGAG CTACATCCTC GCCCGCGGTC TGTTCGATAT CGACCAGAAG TGGAAGTGGG GCTTCACCGC CGAGCGGGCT TCGCAGGCGC TGATCTTCGA CGACTACGAC ATCAGCGACG TCTACCAGCA GCGCGGCCAG TTCACGGCCG ACGACCACCG GCTGATGTCG CAGATCTACA CCACGCGGCA GGACAAGCGG TCCTACTTCT CGGCTTCGAT GATTTCGGTG CAGGGTTTGC GGGTTGTGCA GGTGGATCCG GGCACGGGCC TGGCCAACCG GTTCGAGAAC AGCGGCGCCT TCCCCCTGAT CGGCCCCCTG GTCGAGGGGC GCTGGGAGCC GGAATCGCAC ATTCTGGGTG GCCGGCTTCG CGTCCAGGGC TCCGGCGTGG TGCTGACCCG CTCGGAATCC CAGTTTGGCG AGCCGCCCTA CGCCTATGCC GACTACAAGG GCAAGGACGG CGTGGATTCC ACCCGCGGCA CGATCCAGGG CGACTGGCGC GCCAGCGTGG TCCTGGGTTC GGGCCTGCGC GTTGAGCCGT TCGCCCAGGC GCGCGGCGAC ACCTATCGGG TCAAGGACGT CTTTATCCCG GTCAACGCCT TCACCACGGG CGACACCCAC AGCATCAACT CTTCGCGCGG CCTGGGCGTC GCCGGCGTCG ATCTGAGCCT GCCCATGTTC AAGCCGCTGA AGAACGGCGG CAGCATCGTT CTGGAGCCGC TCGCTCAATT CGCCACCGGC TCCAACAGTT CGCGGGTGCC GATCATCGTG GCCCGTGACG CGGCCGGAAA CCCGATCTAT TTCAACGAAG ACAGCACCAA CTTCGAACTC GACGAAACCA ACCTGTTCGA CGTGAACAAG TCGCCCGGCT TCGACCTCTA CGAAGGCGGC ACGCGCGTCA ATCTCGGCGG TCGCGCCACG GTCAAGTTCG CTGACGGTCG AGGCGGCAGC GTACTGGTCG GCCGCAGCCT GCGCACCAAG GTCGATCCGC TGATGCCGAC CCGCGCCGGT CTCGACCAGA AGGCTTCCGA CTGGATCGTC GCGGCCACGG TCACACCGAT CCGCGGCGTC AACGCCTTCT CCCGCGCCCG TTTCGACAAC GACACCGGCA AGCTCAACCG GATCGAAGCC GGCGTCGATG CGTCGGTTTC GCGGGGCTTC GGTTCGCTCC GCTACCTGCG CGACAACAAG GACACCTCGG GCTTCCGCCA GGAAAACCTG GACTTCTACG GCGACTACAA GATCCGCGAG CACTGGGGCG TGACCGCCCT GGGCCGCCTA TCGTACCAGG ACGCCCGCGC CTTCGGCCTT CCAGCCGCCG ACAGCCAGTG GTCCTGGACC CGTCGCGACC TGGGCGTCTA CTACAAGGAC GACTGCATCC GCATCGACGT GGTCTATCAG AACGAGGACC GTTACACCCA GACGTCGAGC GGACTGAAAT TGAAGGCCGA CGAGTCCGTG GTGCTGCGCC TAACGCTCGC CACATTAGGC GACACACTGT ACAGCAATTA G
|
Protein sequence | MMEFRWGAKD PKAVGMGRAV LLAGAAWLAL ACSAQAQQPL ATVPAAPTPS PAVDDGLGDT GYYLESDLLI RDDANQKMIA RGEVEARYQG RTLRADEVVY DSKTEVVTAH GHVQLINADG TAQFADEMTM DKDMKAGFAR GFSARLDKNI KIAADTAVRR NEQITELNQA IYTPCEVCAE KPKPTWSIQA DKVVQDKNRH LVYYHGATIR MWGAPLLYLP VFWHPDPQTQ RSSGFLTPKL GVSKRRGVSY QQPYLFVFSP SQDLVLTPQI NAKVNPFLNA QYRKRFYSGA VDVRAGGTYD KDFDNHGDRF GKGMFKSYIL ARGLFDIDQK WKWGFTAERA SQALIFDDYD ISDVYQQRGQ FTADDHRLMS QIYTTRQDKR SYFSASMISV QGLRVVQVDP GTGLANRFEN SGAFPLIGPL VEGRWEPESH ILGGRLRVQG SGVVLTRSES QFGEPPYAYA DYKGKDGVDS TRGTIQGDWR ASVVLGSGLR VEPFAQARGD TYRVKDVFIP VNAFTTGDTH SINSSRGLGV AGVDLSLPMF KPLKNGGSIV LEPLAQFATG SNSSRVPIIV ARDAAGNPIY FNEDSTNFEL DETNLFDVNK SPGFDLYEGG TRVNLGGRAT VKFADGRGGS VLVGRSLRTK VDPLMPTRAG LDQKASDWIV AATVTPIRGV NAFSRARFDN DTGKLNRIEA GVDASVSRGF GSLRYLRDNK DTSGFRQENL DFYGDYKIRE HWGVTALGRL SYQDARAFGL PAADSQWSWT RRDLGVYYKD DCIRIDVVYQ NEDRYTQTSS GLKLKADESV VLRLTLATLG DTLYSN
|
| |