Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4597 |
Symbol | |
ID | 8335951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5228922 |
End bp | 5230022 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 644957698 |
Product | biotin synthase |
Protein accession | YP_003115300 |
Protein GI | 256393736 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0502] Biotin synthase and related enzymes |
TIGRFAM ID | [TIGR00433] biotin synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.815308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.006941 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGGACCA TGACCCAGTC CTTTGCGCAC AGCACCCTCG ACGCCCTGCT CTCCGACCTG GTCAGCGCCG CTTTGGACGG CCGGGCGCCG ACCCGCGAGC AGGCCCTCGC GCTGTTGGCG AGCCCTGATG AGGACGTGCT GGACGTCGTC GCCGCGGCCG GGCGCGTGCG GCGGACGTAC TTCGGGAACC GCGTGAAGCT CAACTACCTG GTGAACATGA AGTCCGGGCT CTGCCCTGAG GACTGCTTCT ACTGCTCGCA GCGGCTCGGC AGCGAGGCGG AGATCCTGAA GTACTCCTGG ATCAAGACCG GCGAGGCCGC CGAACTCGCC GCGAAGGCGG TCGGCGCCGG AGCCAAGCGG GTCTGCCTGG TCGCCTCCGG CCGCGGGCCC TCGGACCGCG ACGTCGAGCG TGTCGCCGAC ACCGTCGCCG CGATCAAGGA CGGGACGCCC GACGTCGAGG TCTGCGTCTG CCTCGGTCTG CTCAAGGACG GCCAGGCCGC GCGGCTGGCC GCCGCCGGCG CCGACGCCTA CAGCCACAAC CTCAACACCG CCGAGGAGAA GTACGCCGAC ATCTGCACCA CGCACACCTT CGCCGACCGC GTCAGCACCC TGCAGGACGC CACCGCCGCC GGGCTGTCCC CGTGTTCCGG CGCCATCTTC GGGATGGGGG AGAGCGACGA GGACGTGGTC TCCGTCGCCT TCGCGCTGCG CGACCTGGAC CCGGATTCGG TGCCGGTCAA CTTCCTCATC CCCTTCGAGG GGACCCCGCT CGGCGGGCGA TGGGATCTGA CGCCGGCTCG ATGCCTGCGG ATCCTGGCGC TGTTCCGGTT CGTGTTCCCG GACGTCGAGG TGCGGCTCGC CGGCGGTCGG GAGATCCACC TGCGGACCCA GCAACCGCTC GCGCTGCACC TGGCCAACGC GATCTTCCTC GGCGACTACC TGACCAGCGA GGGGGCGCCG GGCGCCGACG ACCTGGCGAT GATCGCCGAC GCCGGGTTCA GCGTCGAGGG GCGCCAGGAG ACGACGCTGC CGACGGCGCG CGCCGAGCAG GTGGCTTTGC GGCGGCGCGG GGCTGGGACG CAGGTGGCGG CCAACACCTG A
|
Protein sequence | MRTMTQSFAH STLDALLSDL VSAALDGRAP TREQALALLA SPDEDVLDVV AAAGRVRRTY FGNRVKLNYL VNMKSGLCPE DCFYCSQRLG SEAEILKYSW IKTGEAAELA AKAVGAGAKR VCLVASGRGP SDRDVERVAD TVAAIKDGTP DVEVCVCLGL LKDGQAARLA AAGADAYSHN LNTAEEKYAD ICTTHTFADR VSTLQDATAA GLSPCSGAIF GMGESDEDVV SVAFALRDLD PDSVPVNFLI PFEGTPLGGR WDLTPARCLR ILALFRFVFP DVEVRLAGGR EIHLRTQQPL ALHLANAIFL GDYLTSEGAP GADDLAMIAD AGFSVEGRQE TTLPTARAEQ VALRRRGAGT QVAANT
|
| |