Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0024 |
Symbol | |
ID | 5897736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 29760 |
End bp | 30674 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641560507 |
Product | CBS domain-containing protein |
Protein accession | YP_001681660 |
Protein GI | 167643997 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.307011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.720082 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCAGCG ACGACCAGAG TTCCGCGCCC GCCGGTCCGG CTCGATTGAG CCGGGGCGTG CGGGCGTTCT TCCGCAAGAT GCGCAAGGAC CTGGCTGGCC GGGGCGTGCC GGGCCTGGCC GAGACCTCGC GCCCGGCCGA CCCAACCCTC GTCCACGAGG TCGACATGGT CGACCAGGCC GAGGCGTTCC AGAGCCTGCG GGTGGCCGAT GTGATGACCC CGCGCGCCGA CATCGTCGCC GTCGAGGCCT CCAGTCCATT CGAGGCGGTG GTCGCCCAGT TCACCGAGGC CGAGCACTCG CGGATGCCGA TCTATCGCGA GACGCTCGAC GACCCGGTGG GGGTGATCCA CGTCAAGGAC GTGTTCCGCC TGCTGGCCGA CGAGGAAAAG CGTCCGACAC CGAGCGACCA GGTGCTGCAT CGCCTGCGTC GCGAGGCGCT GTACGTGCCG GCTTCGATGC GGGCGGCCGA CCTGCTGCTG CGCATGCGCA CCAGCCGCAT CCACATGGCC CTGGTCATCG ACGAGTTCGG CGGCACCGAC GGTCTGGTCA CCATGGAAGA CCTGATCGAG GCGGTGGTCG GCGAGATCGA CGACGAGCAT GACGACGCCC AGGTCTCCAG CATCGTCGCC CGTCCGGGCG GAGTGTTCGA GGCCGACGCC CGCGCGCCGC TGGAGGACCT GGAGGCCGCC CTTGACCGCG ACCTGGCGCC GCCGGACATG GAAGAGGACA TCGATACGGT CGCCGGTCTG GTCGTGGCCC TGGCCGGCCG CGTGCCGCAG CGGGGCGAGG TGATCGCCCA CCCGGCCGGC TTCGACCTGG AGGTCGTCGA GGCCGATCCG CGCCGGGTGC GCCGGGTCCG GGTGCGTCCG GCCGCGACCC CGGAGCCGGC CGCCGGCCGC GTGGCCGCTT CGTGA
|
Protein sequence | MPSDDQSSAP AGPARLSRGV RAFFRKMRKD LAGRGVPGLA ETSRPADPTL VHEVDMVDQA EAFQSLRVAD VMTPRADIVA VEASSPFEAV VAQFTEAEHS RMPIYRETLD DPVGVIHVKD VFRLLADEEK RPTPSDQVLH RLRREALYVP ASMRAADLLL RMRTSRIHMA LVIDEFGGTD GLVTMEDLIE AVVGEIDDEH DDAQVSSIVA RPGGVFEADA RAPLEDLEAA LDRDLAPPDM EEDIDTVAGL VVALAGRVPQ RGEVIAHPAG FDLEVVEADP RRVRRVRVRP AATPEPAAGR VAAS
|
| |