Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2770 |
Symbol | |
ID | 5900225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3008635 |
End bp | 3010602 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641563262 |
Product | rotamase family protein |
Protein accession | YP_001684395 |
Protein GI | 167646732 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00828579 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000045664 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTCGCCG GCTTCCGCTC CTTCGCCAAA TCGCCGCTCG CGGTGGTCCT GTTTGGTCTG CTCATCGTCA GCTTCGCGGT GTTCGGCATC AGCGACGTGT TCCGTCACCC GCCCGGCAAA TGGGTGATCG CGGCGGGATC GCGCGACACC ACGCCGGCCG ACTTCAAGGC CCGCTTCGAA AGCTATCGCA AGCAGCAGCA GGCCCAGGGC CAGACCCTCA CGCCAGACCT GGCCGTCGAA CATGGCGTCG ATCGCCAGAT GCTGACCCAG CTGGCGCTGC AGGAATCGCT GGCCGAGCTG ATCCGTAAGA TGGGCGTGCG CCCCTCCGAC AAGCTGGTCG GCGACACCCT GCGCGAGCAA CTGGCCAGCC TGCCCCCCGG CCAGCGTCCG TTCGATCCCA TCAACGGCAA GTTCGATTCC AAGATGTACG CCGCCCTGCT GGCCCAGAAC GATCTGACGC CCGCCCGTTA CGAGGCCTCG CTGCGCGACG AGATCGCCCA GGCGCACCTG TTCAGCGCCG TCGCCACCAG CCTGCGCGCG CCGCGCATCT ATTCGGCCCT GCAGGCCGCC TACGCCCTCG AAGGCCGCGA CCTGGCCGCC TTCGCGATCA ATCCGGCGAC CGTCGAAAAG CCGGCTCCGC CGACCGACGC CCAGCTCCAG GCGTTCATGA AGGAACACGC CGCCGACCTG ACGCGCCCCG AGACCCGGGT GCTGTCGGTC GCCCGCTTCA GCGCCAAGGC GCTGGAAAAT TCGGTGACCG TCCCCGAGGC CGACGTCGTC AAGACCTACA ACTTCCGCAA GGATACCCTG GGCACGGCCG AGACCCGCTC GTTCGTCCAG ATCGTCGCCC CCGACGCCAA GGCCGCCGCG GTGATCGCCC AGCGCCTGGC CAAGGGCGAC CAGCCGGCCG TCGTAGCCAG CGCCTTCGGC AAGCAGCCGG TGTTCATCGA CACCAAGCCC AAGTCGGCCC TGCCCGATCG CAAGGTGGCC GACGCCGTGT TCGCCCTGAC CGCCGGCCAG GTCAGCGCCC CGATCACCGG CGACCTTGGC GTCTCGGTGG TCAAGCTGAC CAAGATCACC CCGGCCGCCA TGCCCTCCCT GGAGTCGCAG CGACCGGCCA TCGAGGCCGA GCTGAAGGCC CAGGCCGCCC AGGCCAAGGC CTATGAGCAG ACCCAGGCCT ATCAGGACGC CCACGACGCC GGCGCCAGCC TGATCGACGC GGCGACCAAG GCCGGCGCCC TGGTGCTGAC CACCTCGCCG ATCGCCGCCA CCGGCGTCGA CCAGACCGGC CAGCCGGTCC CCGGCCTGAC CCCGGACGCG GTGAAGGCCG CCTTCGAACT GCCCTCGGGC GGCGAGAGCG AGCTGATCGA GGCCGGCAAG GGCGAGTATT TCGCCGTCCG CGTCGAGAAG GTCATCCCGT CGGCCATGCC GGCCCTGGCC GAGATCCGCG GCCCCCTGGC CCAGCAATGG ATGGTCACCA AGCTGCTCGA GGCCATGAAG GCCAAGGCCG ACGCCCTGGG CGAGCGCGTC AAGAAGGGCG AGTCGCTGGA GGCCGTCGCC GCCTCGGCCG GAACCAAGGT CCAGCGCGTG CCCAACATAA ACCGCGAGAA CGCCCGCCAA TTCCAGGGCC TGGGTCGCGA CCTGCTGATC GCCACCTTCG GCGCCAAGCC CGGCGTTCCG TTCACGGCGC GCGCGCCGCA GGGCGGCTTC CTGGTCGCGC AGGTCGAGAA GGTCCACCCG GGCGCGCCGA TGCAGATCGC CCAGATCACC CAGGCCATGC GCGGCCAGAC CTCCCAGGGC CTGATGCGCG ACGTGGCCGA CTCCGCCCAG GCCGCGGCGA AGGCCCAGCT GAAGACCAAG GTCAACCTGA ACCTGGCCCG CGAGGCGATC GGCGTCGACA CCAGCGCCCT GCCCAAGGAA GAAGGCGGCG GCGGCAAGCC GGCCAAGCCC AAGGGCCAGG CTCAATGA
|
Protein sequence | MLAGFRSFAK SPLAVVLFGL LIVSFAVFGI SDVFRHPPGK WVIAAGSRDT TPADFKARFE SYRKQQQAQG QTLTPDLAVE HGVDRQMLTQ LALQESLAEL IRKMGVRPSD KLVGDTLREQ LASLPPGQRP FDPINGKFDS KMYAALLAQN DLTPARYEAS LRDEIAQAHL FSAVATSLRA PRIYSALQAA YALEGRDLAA FAINPATVEK PAPPTDAQLQ AFMKEHAADL TRPETRVLSV ARFSAKALEN SVTVPEADVV KTYNFRKDTL GTAETRSFVQ IVAPDAKAAA VIAQRLAKGD QPAVVASAFG KQPVFIDTKP KSALPDRKVA DAVFALTAGQ VSAPITGDLG VSVVKLTKIT PAAMPSLESQ RPAIEAELKA QAAQAKAYEQ TQAYQDAHDA GASLIDAATK AGALVLTTSP IAATGVDQTG QPVPGLTPDA VKAAFELPSG GESELIEAGK GEYFAVRVEK VIPSAMPALA EIRGPLAQQW MVTKLLEAMK AKADALGERV KKGESLEAVA ASAGTKVQRV PNINRENARQ FQGLGRDLLI ATFGAKPGVP FTARAPQGGF LVAQVEKVHP GAPMQIAQIT QAMRGQTSQG LMRDVADSAQ AAAKAQLKTK VNLNLAREAI GVDTSALPKE EGGGGKPAKP KGQAQ
|
| |