Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5134 |
Symbol | |
ID | 5897360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010335 |
Strand | + |
Start bp | 53506 |
End bp | 54777 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641555237 |
Product | epocide hydrolase domain-containing protein |
Protein accession | YP_001676568 |
Protein GI | 167621783 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TATCCGGCGC CCTGAACGCG CCCAAGCCCA CACGCCGCCG CCTGCTGACC AGCGCCGCGG GCCTGACCGT TCTGGCCACC GCGACCCACG GCGTCCAAGC CCTGGCCGCG CCCGCCACCG AAGCGGTCGC GCCTTTCAAG GTCCAGGTCG ATCCGGACGT CATCGCCGAT CTGCGACGTC GCTTGACCGC CACGCGGTGG CCCGACGCCG GCGGCGCGGT CGATTGGAGC CAAGGCGTGC CGCTGGCCAA GGCCAAGGCC CTGACCGAGT ACTGGCGGAC AACCTACGAC ATGACGCGGC TGGAACGGCG GCTCAACGCC TTCCCGCAGT TTCGCACGGC GATCGACGGC CTGGGCGTGC ACTTCATCCA CGTCAAATCC AAGCACGCCG ACGCCATGCC GATGATCCTG ACCCATGGTT GGCCAGGTTC GGTCATCGAG TTCCTGGATG TGATCGATCT GCTGACCGAT CCCACGGCGC ATGGCGGCTC GGCCGAGGAC GCCTTCCATG TGGTGATCCC CTCGTTGCCC GGCTACGGCT TCTCCGACAA GCCGGCGGTC CTGGGCTGGG GACTCCCAAA GATCGCCAAG GCTTGGGACA CCCTGATGAA GCGCCTGGGT TATGGCCGCT ACGTGGCCCA GGGCGGGGAC TTGGGCGCTG GCGTCGCCAG CTGGATGTCC AAGCAGGCGC CGCAGGGTCT GGCCGCCATT CACTTGAACC TGCCCATCCT GTTCCCGCCG CCGCCGCCCG GGCCCTCCGG CTACAGCGCC GAGGAGCAAG CGGCCGTGAG CCAGCTGGTG CGCTATGGCT CTGACCTGTC GGCCTACGCC GCCATCCAGG GCACCCGCCC GCAGACGCTC GGCTACGGCC TGGCGGACTC GCCGGTCGGC CAGGCGATGT GGATCTACGA GAAGTTCCAG GCCTGGAGCG ACAACAAGGG CGACCCGGCA GACGCGATCG CCGTCGACAA GATGCTCGAC GACATCATGC TTTACTGGGT GACCGATACG GCCGCCTCGG CCGCGCGCCT CTACAAGGAA AGCTTCTTCA CCGACTTCGC CCGCTTCGAG CTGACCGGGC CGGTCGCTGT GACGATCTTC AAGGGCGACA TCTTCACCCC GCCCAAGAGC TGGGGCGAGC AGACCTACAA GGGCCTGGCC TACTGGAGCG AGCAGGACAA GGGCGGCCAC TTCGCCGCTC TGGAGCAACC GCGCGCCTTC GCCGAGGAAG TCCGCAAGGC CTTCAAGCCC TACCGAGCCT GA
|
Protein sequence | MTDLSGALNA PKPTRRRLLT SAAGLTVLAT ATHGVQALAA PATEAVAPFK VQVDPDVIAD LRRRLTATRW PDAGGAVDWS QGVPLAKAKA LTEYWRTTYD MTRLERRLNA FPQFRTAIDG LGVHFIHVKS KHADAMPMIL THGWPGSVIE FLDVIDLLTD PTAHGGSAED AFHVVIPSLP GYGFSDKPAV LGWGLPKIAK AWDTLMKRLG YGRYVAQGGD LGAGVASWMS KQAPQGLAAI HLNLPILFPP PPPGPSGYSA EEQAAVSQLV RYGSDLSAYA AIQGTRPQTL GYGLADSPVG QAMWIYEKFQ AWSDNKGDPA DAIAVDKMLD DIMLYWVTDT AASAARLYKE SFFTDFARFE LTGPVAVTIF KGDIFTPPKS WGEQTYKGLA YWSEQDKGGH FAALEQPRAF AEEVRKAFKP YRA
|
| |