Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4050 |
Symbol | |
ID | 5901512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 4383984 |
End bp | 4384844 |
Gene Length | 861 bp |
Protein Length | 286 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641564571 |
Product | HAD family hydrolase |
Protein accession | YP_001685673 |
Protein GI | 167648010 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0647] Predicted sugar phosphatases of the HAD superfamily |
TIGRFAM ID | [TIGR01459] HAD-superfamily class IIA hydrolase, TIGR01459 [TIGR01460] Haloacid Dehalogenase Superfamily Class (subfamily) IIA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.706573 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC TGCCTTCTGG CCTTTCCGCC CTCGCCGACC GCTACGACGT GCTCCTCTGC GATGTCTGGG GGGTGATCCA CAACGGGGTC GAGAGCTTCC CCCAGGCCTG CCAGGCCCTG GTCGAATGGC GGACCCACCA CGGGCCGGTG ATCCTGATCT CCAACTCGCC CCGGCCCTCG GCCGCCGTGG TCGAGCAGCT GGACCGGCTG GGCGTGCCGC GCCAGGCCTG GAGCGCCTTC GTCACCTCCG GCGACGCGAC GCGCACCCTG CTGGCCGCCC GTGCGCCCGG CCCGGCCTGG ATCGTCGGCC CCGAGCGCGA CTTCACGCTC TATGAGGGCC TGGACCTGGA GACCGCCGGT CCGGACGACG CGGCCTTCGT CGCCGTCACG GGCATGGTCG ATGACGAGAA CGAGGTCCCC GACGACTATC GGGGTCGCCT GGCCGTCGCC GCCGAGCGCG GCCTGACCCT GATCTGCGCC AATCCCGACC GCGTCGTGCA GCGCGGCAGC CGGCTGATCT ATTGCGGCGG CGCTCTGGCC GACCTCTATG AGAGCCTGGG CGGCGAGGTG TTGATGGCCG GCAAGCCCTA TGGTCCGATC TACGACCTGG CGCTGGCGGA AGCCGAGGCC CTGAAGGGCG GACCGGTCGA TCGCTCGCGC GTGCTGTGCA TCGGGGACGG GGTGATCACC GACGTCAAGG GCGCACAGGA CCAGAATCTG GCCTGCCTGT TCATCGCCAA GGGCATTCAC GGCGAGGCCG CCGTCGGCGC CGACGGCAAG CTCGATCCAG CCGGGGTCGA GGCCCTGCTG GCGGCCGAAA GCGTCGGCGC GACCCACGCC ATGAGCGATC TGGTCTGGTA G
|
Protein sequence | MTDLPSGLSA LADRYDVLLC DVWGVIHNGV ESFPQACQAL VEWRTHHGPV ILISNSPRPS AAVVEQLDRL GVPRQAWSAF VTSGDATRTL LAARAPGPAW IVGPERDFTL YEGLDLETAG PDDAAFVAVT GMVDDENEVP DDYRGRLAVA AERGLTLICA NPDRVVQRGS RLIYCGGALA DLYESLGGEV LMAGKPYGPI YDLALAEAEA LKGGPVDRSR VLCIGDGVIT DVKGAQDQNL ACLFIAKGIH GEAAVGADGK LDPAGVEALL AAESVGATHA MSDLVW
|
| |