Gene Sare_1518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1518 
Symbol 
ID5703562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1748406 
End bp1749401 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content70% 
IMG OID641271024 
Productbiotin synthase 
Protein accessionYP_001536405 
Protein GI159037152 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000115411 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCAGAGA TCCTCGACCA GGCCCGCACC CAGGTACTGG AGAACGGCGT CGGCCTCGAC 
GAGGCCGGTG CCCTCGCCGT GCTCAACCTG CCCGACGAGC ACCTGCCCGC CGCTCTCCAA
CTCGCGCACG AGGTACGGAT GCGCTGGTGC GGGCCGGAGG TCGAGGTCGA GGGGATCGTC
TCGCTGAAGA CCGGCGGCTG CCCGGAGGAC TGTCACTTCT GCTCGCAGTC CGGCCTGTTC
ACCTCGCCGG TGCGCGCGGT GTGGCTGGAC ATCCCGTCGC TGGTGGAGGC GGCGAAGCAG
ACCGCCAAGA CCGGCGCGAC CGAGTTCTGC ATCGTGGCCG CCGTGCGCGG CCCGGACGAC
CGGCTGATGC GGCAACTGCG GGAGGGAGTC GCCGCGATCC GCGCCGAGGT CGACATCCAC
GTGGCGGCCT CGGTCGGCAT GCTCACCCAG GAGCAGGTCG ACGAGTTGGT CGAGATGGGC
GTACACCGCT ACAACCACAA CCTGGAGACC TGCCGCTCGT ACTTCCCGAA CGTGGTCACC
ACCCACTCCT GGGAGGAACG CTGGGAGACG CTGCGGATGG TCCGCGCGTC CGGCATGGAG
GTTTGCTGCG GTGGCATCCT CGGGCTGGGG GAGACCGTCG AGCAGCGCGC CGAGTTCGCC
GCCCAGCTCG CCGAGCTGGA CCCGCACGAG GTCCCGCTGA ACTTCCTCAA CCCCCGGCCC
GGCACCCCGC TCGGTGACCG TCCGGTGGTG GAGGGCAAGG ACGCGCTGCG TGCCATCGCC
GCGTTCCGGC TCGCCATGCC ACGCACGATC CTCCGGTACG CCGGTGGCCG CGAGATCACC
CTGGGCGACC TGGGTACCCG TAGCGGCCTG CTCGGCGGCA TCAACGCGGT GATCGTCGGC
AACTACCTGA CCACGCTGGG CCGTCCGGCC ACGACGGACC TGGAACTTCT GGACGACCTG
AAGATGCCGG TCAAGGCACT CTCCGCGACG TTGTGA
 
Protein sequence
MPEILDQART QVLENGVGLD EAGALAVLNL PDEHLPAALQ LAHEVRMRWC GPEVEVEGIV 
SLKTGGCPED CHFCSQSGLF TSPVRAVWLD IPSLVEAAKQ TAKTGATEFC IVAAVRGPDD
RLMRQLREGV AAIRAEVDIH VAASVGMLTQ EQVDELVEMG VHRYNHNLET CRSYFPNVVT
THSWEERWET LRMVRASGME VCCGGILGLG ETVEQRAEFA AQLAELDPHE VPLNFLNPRP
GTPLGDRPVV EGKDALRAIA AFRLAMPRTI LRYAGGREIT LGDLGTRSGL LGGINAVIVG
NYLTTLGRPA TTDLELLDDL KMPVKALSAT L