Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2121 |
Symbol | |
ID | 5899576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2284624 |
End bp | 2285730 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562610 |
Product | hemolysin-type calcium-binding region |
Protein accession | YP_001683747 |
Protein GI | 167646084 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.190728 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.343329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACA TAAATGGAAC TTCCGGGGAC GATATCGTGT ACCTCGGCAC GACCCTGACT CCGTCGAATA TCTATGACGC CCGGGGCGGT AACGATTATG TGGTCGGCTC GGCTGGCGCC AATCTGATCC TGGGCGGCGA TGGTGACGAC ATCATCGATG GCCGCGGCGG CAGCGACGTG ATCTTCGGCG GCGCTGGCAA CGACGTGGTC TTCGGCGGCA ACGGTTCGGA TCTGCTGTTC GGCGACGCTG GCGCGGACGT GGTCCGCGGC GGCGCGGGCG TCGATATCGT GTCCGGCGGC GCCGGCAACG ACACCTTCGT CTTCCACGCA GGCGACGTCT CCAGCACGTC GCTGCTGCCC GACATGGTTC TCGACTTCCG TGGCGCGGGC AACACCACGG GCGCCGAACA GGACGTCTTG GAACTGCACG GCTTCTCGGC CGGTTCGACG CTGGACTTCG TCGGCAACGT CGGCGACGCG ACCCACCAGC TCTACAAGAT CAACGACGCT GTCGACTCGT CGGCCAGCGG CTACGTGCTG ATCAACACCG ACAGCGCCGC CCACCTGAAC GCCAACGACG TCAAGTTCGT CGCCAAGCCG ACCGAAGCGG TGATCATCGA CTTCGAAGAC ATCGACGCTT CGATCGATCC CTCGATCGCC GCCGGCTATC ACGGCTTCAA CTTCTCCGCC GGCGGCAGCC AGTTGGTTGC GATCGACGTC GACCAATTCC AGAACTCGGG CTACCACACC GCGATCGACG GCGTGACGGG CGCCAGCAAT CCGTTCGCCG TGGATCCGGT GGTCGTGACC CGTTCGGACG GTTCCGACTT TGTCTTCAAC AGCGTCAACA TCGCGGCGGC CTCGGACGCC TCGCAAGTCG TTACGCTCGA AGGCTACAGC GACGGCGTCA AGGTCGGCTC GATCACGACG ACGATCACCT ACGGCCACGC GCTGGAGGTC GACGCCAACT GGGGTCTGAT CGACACCCTG GTCATCGACC GGATTTCCAC CACCGGCGAC TACAGCCCGA CCGACCAGAC CAACGGGTCG CAGTTCGTTC TCGACAACTT CTCGTTTGTC GTCAGCGGCG ACTATCTGCA CGGCTAA
|
Protein sequence | MSNINGTSGD DIVYLGTTLT PSNIYDARGG NDYVVGSAGA NLILGGDGDD IIDGRGGSDV IFGGAGNDVV FGGNGSDLLF GDAGADVVRG GAGVDIVSGG AGNDTFVFHA GDVSSTSLLP DMVLDFRGAG NTTGAEQDVL ELHGFSAGST LDFVGNVGDA THQLYKINDA VDSSASGYVL INTDSAAHLN ANDVKFVAKP TEAVIIDFED IDASIDPSIA AGYHGFNFSA GGSQLVAIDV DQFQNSGYHT AIDGVTGASN PFAVDPVVVT RSDGSDFVFN SVNIAAASDA SQVVTLEGYS DGVKVGSITT TITYGHALEV DANWGLIDTL VIDRISTTGD YSPTDQTNGS QFVLDNFSFV VSGDYLHG
|
| |