Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2067 |
Symbol | |
ID | 5899522 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2207303 |
End bp | 2208745 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562556 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001683693 |
Protein GI | 167646030 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5520] O-Glycosyl hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0826948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATCA AAGTCCCTGT TCGACGCCCA GCCGCGCGCC TCGGCGCGGC GGCCCTGCTG CTAGGTCTGG CGGCCTGCGC GACCACGCCG CCGGTCGTCC CCACCCACGC AGGAGCATGG ATCACCACGG CGGACCGGAG CCAGATGCTG GCGGCGCAGC CCGCCCTGGT CTTCGGGTCA GAGGACGTCG CCCTGGGGCT GCCGGTGATC ACCGTCGACG CCGCGGAACG CCACCAGAGC ATGGTCGGTT TCGGCGCGGC GATCACCGAC GCCTCGGCCT GGCTGATCCA GAACCGGCTG ACGCCCGACC AGCGCGAGCA ATTGCTCAGG GAGCTCTATG GGCGGGGCGA GGGGGAGCTC GGCTTCAGCT TCACGCGCCT GACGATCGGC GCCTCGGATT TCTCGTCCGA ACACTACAGC CTCGACGACG CGCCGGGCGG CGCGGCCGAT CCGGAGCTGG CTCACCTGTC GCTGGGCCGT CCGGCGCAGG CGGTCTTCCC GACCGTCAGG CAGGTCCTGG CGATCAATCC TGATCTCAAG GTCATGGCCT CGCCCTGGAG CGCCCCGGCC TGGATGAAGA CCACCGGCTC GCTGATCAAG GGCCAGCTGA AATCCGAGGC CTATCCGACC TATGCCCGCT TCTTCGTGCG CTATGTTGAC GGGGCGGCCA AGCTGGGCGT GCCGATCGAC TATCTGAGCA TCCAGAACGA GCCGGACTTC GAGCCGGAGA ACTATCCTGG CATGCGCTGG GGCGCGGCCG ATCGGGCGCG GTTCATCGGC GAGAACCTGG GGCCGGCGTT CAAGCAACAT GGCGTGCGGA CCCGGATCCT CGAATGGGAC CACAACTGGG ACCAGCCGCA GCAGCCCCTG ACCGCGCTGG CCGATCCGAA GGCCGCGCCG TTCATCGCCG GCGTGGCCTG GCACTGTTAC GCCGGCGACG TCGCCGCCCA GGCCAAGGTC GCCGGCGCCC ATCCCGACAA GGACGTGTTC TTCACCGAAT GCTCGGGCGG CGACTGGTCG GGTCCGTTCG ACGAAAGCTT CGGCTGGCTG ATGCGCAACC TGGTGATCGG CTCCACCCGC AACGGCGCCC GGGGCGTGCT GATGTGGAAC CTGGCGCTTG ACGAAACCCA CGGACCGCAC AAGGGCGGAT GCGGGGACTG TCGGGGCGTG GTGACCATAG ACAGCCGCAC CGGCGCGATC ACCCGCAACC CCGAATACTA CGCCTTTGGA CACGCCAGCC GGTTCGTGAG GCCGGGCGCC GTACGGATCG ACTCGTCAGA AACCGCGAGC CTGCCGAGCG TCGCCTTCCG CAATCCCGAC GGCGGTCGCG TGCTGGTCGT CTTCAATTCC GGCAAGGATC GCCAGGCGTT TAGCGTCCGC GAGGGCGGTC GGGTCGCCAA GACCTCGCTG CCAGGCGGCG CCGCGGCGAC GTTTGTCTGG TAG
|
Protein sequence | MTIKVPVRRP AARLGAAALL LGLAACATTP PVVPTHAGAW ITTADRSQML AAQPALVFGS EDVALGLPVI TVDAAERHQS MVGFGAAITD ASAWLIQNRL TPDQREQLLR ELYGRGEGEL GFSFTRLTIG ASDFSSEHYS LDDAPGGAAD PELAHLSLGR PAQAVFPTVR QVLAINPDLK VMASPWSAPA WMKTTGSLIK GQLKSEAYPT YARFFVRYVD GAAKLGVPID YLSIQNEPDF EPENYPGMRW GAADRARFIG ENLGPAFKQH GVRTRILEWD HNWDQPQQPL TALADPKAAP FIAGVAWHCY AGDVAAQAKV AGAHPDKDVF FTECSGGDWS GPFDESFGWL MRNLVIGSTR NGARGVLMWN LALDETHGPH KGGCGDCRGV VTIDSRTGAI TRNPEYYAFG HASRFVRPGA VRIDSSETAS LPSVAFRNPD GGRVLVVFNS GKDRQAFSVR EGGRVAKTSL PGGAAATFVW
|
| |