Gene Caul_4625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4625 
Symbol 
ID5902087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5004610 
End bp5005632 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content68% 
IMG OID641565144 
Productbiotin synthase 
Protein accessionYP_001686243 
Protein GI167648580 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0502] Biotin synthase and related enzymes 
TIGRFAM ID[TIGR00433] biotin synthetase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.970025 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGA TCAACGCCCA GCTCGCCCAT GAACCCCGCC ACGACTGGAC GCTTCCCCAG 
GTGGAAGCCC TGTTCGACCT GCCCTTCATG GAGCTGATGT TCCAGGCCGC CACCGTGCAC
CGGGCCTGGT TCGACCCGTC GGAACTGCAG CTGTCGCAGC TGCTGTCGGT CAAGACCGGC
GGCTGCGCCG AGAACTGCGG CTATTGCAGT CAGTCGGCCC ACTTCAAGAC CGGCCTGAAG
GCCGAGAAGC TGATGGACGC CGAGGTGGTG ATCGCCAAGG CCCGCGAGGC CCGAGACGGC
GGCGCCCAGC GCTTCTGCAT GGGCGCGGCC TGGCGCGAGC TGAAGGACCG CGACCTGCCC
AAGCTGGCCG CCATGATCGG CGGCGTGAAG GCCCTGGGCC TGGAAACCTG CGCCACCCTG
GGCATGCTGA CCGCGGAACA GGCCAAGCAG CTCAAGGACG CAGGGCTCGA CTACTACAAC
CACAACCTCG ACACCGGCCC GGAATATTAC GGCGACGTGG TGTCGACCCG CACCTACCAA
GAGCGCCTCG ACACCCTGGC CTACGTCCGC GACGCCGGCA TGAGCACCTG CTGCGGCGGC
ATCGTCGGCA TGGGCGAAAC CCGCCGCGAC CGCGCCAGCC TGCTGCATCA GTTGGCCACC
CTGCCCAGCC ATCCCGACAG CCTGCCGGTC AACGCCCTGG TGCCGGTGGC CGGTACGCCG
CTGGGCGACA AGGTCAAGCG CGAGGGCGAG ATCGACGGGC TGGAGTTCGT GCGCACCGTG
GCGGTGGCCC GGATCGTCTG CCCCAAATCC ATGGTCCGCC TCTCGGCCGG CCGCGACGAC
ATGAGCCGCG AGCTGCAGGC CCTGTGCTTC ATGGCCGGCG CCAACTCGAT CTTCGTCGGC
GGCAAGCTGC TGACCACCCC GCTGCCGAAC ATGGACGACG ACAGCAAGCT GTTCCTCGAC
CTGAACATGC GCCCGATGGG CTCGGCCAAG ATTGTGGCGC CCGAGAGCGT CGCGGCAGAG
TAA
 
Protein sequence
MTQINAQLAH EPRHDWTLPQ VEALFDLPFM ELMFQAATVH RAWFDPSELQ LSQLLSVKTG 
GCAENCGYCS QSAHFKTGLK AEKLMDAEVV IAKAREARDG GAQRFCMGAA WRELKDRDLP
KLAAMIGGVK ALGLETCATL GMLTAEQAKQ LKDAGLDYYN HNLDTGPEYY GDVVSTRTYQ
ERLDTLAYVR DAGMSTCCGG IVGMGETRRD RASLLHQLAT LPSHPDSLPV NALVPVAGTP
LGDKVKREGE IDGLEFVRTV AVARIVCPKS MVRLSAGRDD MSRELQALCF MAGANSIFVG
GKLLTTPLPN MDDDSKLFLD LNMRPMGSAK IVAPESVAAE