Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2413 |
Symbol | |
ID | 5899868 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2634996 |
End bp | 2636561 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562904 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001684038 |
Protein GI | 167646375 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3507] Beta-xylosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.333908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.419565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGAAAC CGTTCCGCCG CCTGACGCTG GCCGTGGCCG CCGGCCTGGG CTGTCTGGGC GCCGCCGCCA CGGCTCAGGG CCAGGTGTGG CGCGCCGACA GCGGGCAGGG GACCTATCAG AACCCGCCGC TCTACGCCGA CTATCCCGAT CCCGACATCA TCCGGGTGGG TGAGGACTTC TACTTCGCCT CGACGACCTT CGTGAACGCG CCGGGCCTGA CGATCCTGCA CTCGCGGGAC CTGGTGAACT GGGACATCGC AAGCCATGTC ATGCCGCGTC TGGAGGGCAA CCCGAAGTAC GACCTCCGCG AGGGCGGCGA CTATCGCCAC GGCCTGTTCG CGCCCAGCCT TCGTCACCAC AACGGCCGCT TCTACATCGC CGTCACGCCG GTGGGCCACC CCACCCGGAT CTATTCGGCC GCCGACATCC GAGGGTCCTG GACCGTGCAC GAACTCGACC GCGAGGCGTT CGACCCAGGC CTGTTCTTCG ACAAGGACGG CAGGGCGTTC ATCGTCACCT CGGTCGGCTC GGATGGGACG ATCCGCCTCC TGACCCTCAA CGCAGACCTC ACGGCCGTCA CCGGCGAGCA GAAGATCCAC TATGTGAAGG GCGCCGAGGG CTCCAAGCTG ATCCGGCGCG GCGACTGGTA CTATCTGTTC AATTCCATTC CACGACGCTT GGCCCTGACG GTGTCGCGCG CCAAGATCCT CACGGGTCTG TGGGAGACCC GCGAACAGAT CGACGACACC ACCGGCGGCC ATCAGGGCGC CCTGGTCGAT CTGCCGGGTG GCGGCTGGTA CGGCTTTGTC ATGCGCGACG CGGGCGCGAT TGGCCGGGTC ACCAATATCA GTCCGGTGTT CTGGCGCGAC GACTGGCCCG TTTGGGGCAC GCCCGACGCG CCAGGCCGGG TTCCCGACCG CGCCGCCAAG CCGATCTTGG GCAAGCCTTT CGTCGAGCCA CCCAGCTCGG ACGATTTCAA GGGGCGCGCG CTTGGCCGGC AATGGCAGTG GAACCATAAC CCCGAAACCA GCCGCTGGTC GCTCAGCGCG CGGCCCGGTT TCCTGCGGCT CCAGGCGACA AAAAGCGCCG ACTTCTGGAC AGCTCGCAAC ACCCTGATCC AGAAAGGGCA GGGACCCAGG AGCCGCGCTG TCGTCAAGCT CGACGTCAGG GCCTTGGCGC CGGGCGACGC CTGCGGTTTT GGAACGTTCG GCAAGTTCTC CAATCAGCTT GTTGTGACGC GCGCGCCCGG CGGCCGGGGC GCGGTGAGCG CGCGGGTCGT GGAAAGCACC GAGACCGGCC CGGCGACCAC GCCGCGCGGC GAAGCGCGCG CCATCCCCCT GCGGAACCTC TGGCTTTCGG TCGACATGGA CTTTAGCGCA GACAAGGCCG CCCTGGCCTA CAGTCTCGAC GGCAGGGCCT GGACGGCGAT GCCGGGTGAT TTCCCGCTGG CCTTCGCCTG GCGCACCGGC ACCTTCCAGG GCGAGCAGTT CGGCCTCTTC TGCTACAATC CCGCCGGCGG CGCCGGGCGC CTGGATGTCG ACAGTTTCAC CCTGAGCAAA CCCTAG
|
Protein sequence | MSKPFRRLTL AVAAGLGCLG AAATAQGQVW RADSGQGTYQ NPPLYADYPD PDIIRVGEDF YFASTTFVNA PGLTILHSRD LVNWDIASHV MPRLEGNPKY DLREGGDYRH GLFAPSLRHH NGRFYIAVTP VGHPTRIYSA ADIRGSWTVH ELDREAFDPG LFFDKDGRAF IVTSVGSDGT IRLLTLNADL TAVTGEQKIH YVKGAEGSKL IRRGDWYYLF NSIPRRLALT VSRAKILTGL WETREQIDDT TGGHQGALVD LPGGGWYGFV MRDAGAIGRV TNISPVFWRD DWPVWGTPDA PGRVPDRAAK PILGKPFVEP PSSDDFKGRA LGRQWQWNHN PETSRWSLSA RPGFLRLQAT KSADFWTARN TLIQKGQGPR SRAVVKLDVR ALAPGDACGF GTFGKFSNQL VVTRAPGGRG AVSARVVEST ETGPATTPRG EARAIPLRNL WLSVDMDFSA DKAALAYSLD GRAWTAMPGD FPLAFAWRTG TFQGEQFGLF CYNPAGGAGR LDVDSFTLSK P
|
| |