Gene Csal_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1167 
Symbol 
ID4028106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1333425 
End bp1334537 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content68% 
IMG OID637966344 
Productbiotin synthase 
Protein accessionYP_573222 
Protein GI92113294 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.893176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCCC AGTCCCGCGA CCCTGCCTGG ACCGATGCGT CGCCGACCTT CCAGCCGACG 
ATGCGCCACG ACTGGTCCCT GGAGGAGATC GAAGCGCTGT TCGCGCTGCC CTTCAACGAC
TTGCTGTTCC GTGCCCAGCA GGTCCACCGC GCGCACTTCG ATCCCAACGC CGTGCAGGTC
TCGACCCTGC TGTCGATCAA GACCGGCGCC TGCCCCGAGG ACTGCAAGTA CTGCCCGCAA
TCCGGGCACT ACAACACCGG TCTCGGCAAG GAGAAGCTGC TCGAGATCGA GAAGGTGGTG
GAGCAGGCCC GTGCCGCCAA GGCGGCCGGT GCCAGCCGCT TCTGCATGGG CGCCGCCTGG
CGCAGCCCGC GGGAGAAGGA TCTGCGGGTG GTGACGGAGA TGGTCGGCCG GGTCAAGGCG
CTGGGGCTGG AGACCTGCAT GACGCTCGGC ATGGTCGACG TCGATCAGGC CAGGCGCCTC
GCCGAGGCCG GGCTCGACTA CTACAACCAC AACCTGGATA CCTCGCCGGA CTACTACGGC
GAGATCATCA CCACCCGCAC CTATGCCGAC CGCCTGGAGA CGCTCGCCAA CGTGCGCGAA
GCGGGCATGA AGGTCTGCTC CGGCGGCATC CTGGGCATGG GCGAGGCACC TCGCGATCGC
GCCGCCCTGC TCCAGCAGCT GGTACGCCTG GATCCGCATC CCGAGTCGGT GCCGATCAAC
ATGCTGGTCA AGGTGCCGGG CACCCCGATG GAAAACGTCG AGGACATGGA CCCGCTGACG
TTCATTCGCG CCATCGCCGT GGCCCGCATT CTGATGCCCA AGAGCCACGT GCGCCTGTCC
GCCGGGCGCG AGCAGATGGA CGAGTCGACC CAGGCCCTGG CCTTCCTGGC CGGCGCCAAC
TCGATCTTCT ACGGCGACAC CCTGCTGACC ACCGGCAACC CCCAGGTGGA GCGCGACCGG
GCACTGTTCG ACAAGCTCGG CCTGCATCCC GAACCCAGCG ACCCGCATGC GGACGACGCC
CACCGTGACG ACGAACAGGC CGAGATCGCG CTGGCCCATG CCATTCAGCG CCAGCGTGAC
GACGCCCTTT TCTACGACGC CACCCGGGGC TGA
 
Protein sequence
MTAQSRDPAW TDASPTFQPT MRHDWSLEEI EALFALPFND LLFRAQQVHR AHFDPNAVQV 
STLLSIKTGA CPEDCKYCPQ SGHYNTGLGK EKLLEIEKVV EQARAAKAAG ASRFCMGAAW
RSPREKDLRV VTEMVGRVKA LGLETCMTLG MVDVDQARRL AEAGLDYYNH NLDTSPDYYG
EIITTRTYAD RLETLANVRE AGMKVCSGGI LGMGEAPRDR AALLQQLVRL DPHPESVPIN
MLVKVPGTPM ENVEDMDPLT FIRAIAVARI LMPKSHVRLS AGREQMDEST QALAFLAGAN
SIFYGDTLLT TGNPQVERDR ALFDKLGLHP EPSDPHADDA HRDDEQAEIA LAHAIQRQRD
DALFYDATRG