Gene Caci_4597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4597 
Symbol 
ID8335951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5228922 
End bp5230022 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content71% 
IMG OID644957698 
Productbiotin synthase 
Protein accessionYP_003115300 
Protein GI256393736 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.815308 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.006941 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGACCA TGACCCAGTC CTTTGCGCAC AGCACCCTCG ACGCCCTGCT CTCCGACCTG 
GTCAGCGCCG CTTTGGACGG CCGGGCGCCG ACCCGCGAGC AGGCCCTCGC GCTGTTGGCG
AGCCCTGATG AGGACGTGCT GGACGTCGTC GCCGCGGCCG GGCGCGTGCG GCGGACGTAC
TTCGGGAACC GCGTGAAGCT CAACTACCTG GTGAACATGA AGTCCGGGCT CTGCCCTGAG
GACTGCTTCT ACTGCTCGCA GCGGCTCGGC AGCGAGGCGG AGATCCTGAA GTACTCCTGG
ATCAAGACCG GCGAGGCCGC CGAACTCGCC GCGAAGGCGG TCGGCGCCGG AGCCAAGCGG
GTCTGCCTGG TCGCCTCCGG CCGCGGGCCC TCGGACCGCG ACGTCGAGCG TGTCGCCGAC
ACCGTCGCCG CGATCAAGGA CGGGACGCCC GACGTCGAGG TCTGCGTCTG CCTCGGTCTG
CTCAAGGACG GCCAGGCCGC GCGGCTGGCC GCCGCCGGCG CCGACGCCTA CAGCCACAAC
CTCAACACCG CCGAGGAGAA GTACGCCGAC ATCTGCACCA CGCACACCTT CGCCGACCGC
GTCAGCACCC TGCAGGACGC CACCGCCGCC GGGCTGTCCC CGTGTTCCGG CGCCATCTTC
GGGATGGGGG AGAGCGACGA GGACGTGGTC TCCGTCGCCT TCGCGCTGCG CGACCTGGAC
CCGGATTCGG TGCCGGTCAA CTTCCTCATC CCCTTCGAGG GGACCCCGCT CGGCGGGCGA
TGGGATCTGA CGCCGGCTCG ATGCCTGCGG ATCCTGGCGC TGTTCCGGTT CGTGTTCCCG
GACGTCGAGG TGCGGCTCGC CGGCGGTCGG GAGATCCACC TGCGGACCCA GCAACCGCTC
GCGCTGCACC TGGCCAACGC GATCTTCCTC GGCGACTACC TGACCAGCGA GGGGGCGCCG
GGCGCCGACG ACCTGGCGAT GATCGCCGAC GCCGGGTTCA GCGTCGAGGG GCGCCAGGAG
ACGACGCTGC CGACGGCGCG CGCCGAGCAG GTGGCTTTGC GGCGGCGCGG GGCTGGGACG
CAGGTGGCGG CCAACACCTG A
 
Protein sequence
MRTMTQSFAH STLDALLSDL VSAALDGRAP TREQALALLA SPDEDVLDVV AAAGRVRRTY 
FGNRVKLNYL VNMKSGLCPE DCFYCSQRLG SEAEILKYSW IKTGEAAELA AKAVGAGAKR
VCLVASGRGP SDRDVERVAD TVAAIKDGTP DVEVCVCLGL LKDGQAARLA AAGADAYSHN
LNTAEEKYAD ICTTHTFADR VSTLQDATAA GLSPCSGAIF GMGESDEDVV SVAFALRDLD
PDSVPVNFLI PFEGTPLGGR WDLTPARCLR ILALFRFVFP DVEVRLAGGR EIHLRTQQPL
ALHLANAIFL GDYLTSEGAP GADDLAMIAD AGFSVEGRQE TTLPTARAEQ VALRRRGAGT
QVAANT