Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4625 |
Symbol | |
ID | 5902087 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5004610 |
End bp | 5005632 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641565144 |
Product | biotin synthase |
Protein accession | YP_001686243 |
Protein GI | 167648580 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 0.970025 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCAGA TCAACGCCCA GCTCGCCCAT GAACCCCGCC ACGACTGGAC GCTTCCCCAG GTGGAAGCCC TGTTCGACCT GCCCTTCATG GAGCTGATGT TCCAGGCCGC CACCGTGCAC CGGGCCTGGT TCGACCCGTC GGAACTGCAG CTGTCGCAGC TGCTGTCGGT CAAGACCGGC GGCTGCGCCG AGAACTGCGG CTATTGCAGT CAGTCGGCCC ACTTCAAGAC CGGCCTGAAG GCCGAGAAGC TGATGGACGC CGAGGTGGTG ATCGCCAAGG CCCGCGAGGC CCGAGACGGC GGCGCCCAGC GCTTCTGCAT GGGCGCGGCC TGGCGCGAGC TGAAGGACCG CGACCTGCCC AAGCTGGCCG CCATGATCGG CGGCGTGAAG GCCCTGGGCC TGGAAACCTG CGCCACCCTG GGCATGCTGA CCGCGGAACA GGCCAAGCAG CTCAAGGACG CAGGGCTCGA CTACTACAAC CACAACCTCG ACACCGGCCC GGAATATTAC GGCGACGTGG TGTCGACCCG CACCTACCAA GAGCGCCTCG ACACCCTGGC CTACGTCCGC GACGCCGGCA TGAGCACCTG CTGCGGCGGC ATCGTCGGCA TGGGCGAAAC CCGCCGCGAC CGCGCCAGCC TGCTGCATCA GTTGGCCACC CTGCCCAGCC ATCCCGACAG CCTGCCGGTC AACGCCCTGG TGCCGGTGGC CGGTACGCCG CTGGGCGACA AGGTCAAGCG CGAGGGCGAG ATCGACGGGC TGGAGTTCGT GCGCACCGTG GCGGTGGCCC GGATCGTCTG CCCCAAATCC ATGGTCCGCC TCTCGGCCGG CCGCGACGAC ATGAGCCGCG AGCTGCAGGC CCTGTGCTTC ATGGCCGGCG CCAACTCGAT CTTCGTCGGC GGCAAGCTGC TGACCACCCC GCTGCCGAAC ATGGACGACG ACAGCAAGCT GTTCCTCGAC CTGAACATGC GCCCGATGGG CTCGGCCAAG ATTGTGGCGC CCGAGAGCGT CGCGGCAGAG TAA
|
Protein sequence | MTQINAQLAH EPRHDWTLPQ VEALFDLPFM ELMFQAATVH RAWFDPSELQ LSQLLSVKTG GCAENCGYCS QSAHFKTGLK AEKLMDAEVV IAKAREARDG GAQRFCMGAA WRELKDRDLP KLAAMIGGVK ALGLETCATL GMLTAEQAKQ LKDAGLDYYN HNLDTGPEYY GDVVSTRTYQ ERLDTLAYVR DAGMSTCCGG IVGMGETRRD RASLLHQLAT LPSHPDSLPV NALVPVAGTP LGDKVKREGE IDGLEFVRTV AVARIVCPKS MVRLSAGRDD MSRELQALCF MAGANSIFVG GKLLTTPLPN MDDDSKLFLD LNMRPMGSAK IVAPESVAAE
|
| |